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FIELD OF THE INVENTION 

The present invention relates to compositions and methods for treating, 
characterizing, and diagnosing cancer. In particular, the present invention provides 
gene expression profiles associated with solid tumor stem cells, as well as novel stem 
15 cell cancer markers useful for the diagnosis, characterization, and treatment of solid 
tumor stem cells. 



BACKGROUND OF THE INVENTION 

Breast cancer is the most common female malignancy in most industrialized 
20 countries, as it is estimated to affect about 10% of the female population during their 
lifespan. Although its mortality has not increased along with its incidence, due to 
earlier diagnosis and improved treatment, it is still one of the predominant causes of 
death in middle-aged women. Despite earlier diagnosis of breast cancer, about 1-5% 
of women with newly diagnosed breast cancer have a distant metastasis at the time of 
25 the diagnosis. In addition, approximately 50% of the patients with local disease who 
are primarily diagnosed eventually relapse with the metastasis. Eighty-five percent of 
these recurrences take place within the first five years after the primary manifestation 
of the disease. 

On presentation, most patients with metastatic breast cancer have only one or 
30 two organ systems involved. As the disease progresses over time, multiple sites 

usually become involved. Indeed, metastases may be found in nearly every organ of 

the body at autopsy. The most common sites of metastatic involvement observed are 

locoregional recurrence* in the fktn and soft tissues of the chest wall, as well as tn 

axilla, and supraclavicular area. The most common site for distant metastasis is the 
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bone (30 - 40% of distant metastasis), followed by lung and liver. Metastatic breast 
cancer is generally considered to be an incurable disease. However, the currently 
available treatment options often prolong the disease-free state and overall survival 
rate, as well as increase the quality of the life. The median survival from the 
5 manifestation of distant metastases is about three years. 

Although great strides have been made understanding the genetic changes that 
lead to cancer (e.g. breast cancer), the lack of reliable tumor assay for de novo human 
cancer cells has hindered the ability to undertand the effects of these mutations at the 
cellular level. Also, the lack of identified cancer markers for solid tumor stem cells 
10 has hindered the development of diagnostics and thereapeutics for cancer patients 

(e.g. breast cancer patients). As such, what is needed is a reliable tumor assay as well 
as the identification of cancer markers for solid tumor stem cells. 

SUMMARY OF THE INVENTION 

1 5 The present invention relates to compositions and methods for treating, 

characterizing, and diagnosing cancer. In particular, the present invention provides 
gene expression profiles associated with solid tumor stem cells, as well as novel stem 
cell cancer markers useful for the diagnosis, characterization, and treatment of solid 
tumor stem cells. 

20 In some embodiments, the present invention provides methods of detecting 

solid tumor stem cells, comprising; a) providing a tissue sample from a subject, and b) 
detecting at least one stem cell cancer marker (e.g., 1, 2, 3, 5, 10, ... etc.) from Tables 
4-8 in the tissue sample under conditions such that the presence or absence of solid 
tumor stem cells in the tissue sample is determined. In particular embodimnets, the 

25 detecting comprises determining the presence of (or absence of), or an expression 

level for the at least one stem cell cancer marker. In other embodiments, the detecting 
comprises detecting mRNA expression of the at least one stem cell cancer marker. In 
particular embodiments, the detecting comprises exposing the stem cell cancer marker 
mRNA to a nucleic acid probe complementary to the stem cell cancer marker mRNA. 

30 In certain embodiments, the detecting comprises detecting polypeptide 

expression of the at least one stem cell cancer marker. In other embodiments, the 
detecting comprises exposing the stem cell cancer marker polypeptide to an antibody 
specific to the stem cell cancer marker polypeptide and detecting the binding of the 
antibody to the stem cell cancer polypeptide. In further embodiments, the subject 
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comprises a human subject. In additional embodiments, the tissue sample comprises 
tumor tissue. In some embodiments, the tumor tissue sample is a post-surgical tumor 
tissue sample (e.g. tumor biopsy). 

In other embodiments, the methods further comprise c) providing a prognosis 
5 to the subject. In some embodiments, the at least one stem cell cancer marker is from 
Table 8. In preferred embodiments, the at least one stem cell cancer marker 
comprises: Bmi-1, eed, easyhl, easyh2, rnf2, yyl, smarcA3, smarcAS, smarcD3, 
smarcEl, mllt3, FZD1, FZD2, FZD3, FZD4, FZD6, FZD7, FZD8, FZD9, FZD10, 
WNT2, WNT2B, WNT3, WNT5A, WNT10B, WNT16, AXIN1, BCL9, MYC, and 
10 (TCF4). 

In particular embodiments, the present invention provides methods for 
reducing the size of a solid tumor (e.g. in research drug screening, or therapeutic 
applications) comprising contacting cells of a solid tumor with a biologically (e.g. 
therapeutically) effective amount of a composition comprising at least one agent 

15 directed against at least one stem cell cancer marker shown in Tables 4-8. In some 
embodiments, the biologically effective amount is an amount sufficient to cause cell 
death of or inhibit proliferation of solid tumor stem cells in the solid tumor. In other 
embodiments, the biologically effective amount is an amount interferences with the 
survival pathyways (e.g. notch related genes) or self-renewal pathaways (e.g. WNT 

20 pathways) of the solid tumor stem cell. 

Examples of solid tumors from which solid tumor stem cells can be isolated or 
enriched for according to the invention include, but are not limited to, sarcomas and 
carcinomas such as, but not limited to: fibrosarcoma, myxosarcoma, liposarcoma, 
chondrosarcoma, osteogenic sarcoma, chordoma, angiosarcoma, endotheliosarcoma, 

25 lymphangiosarcoma, lymphangioendotheliosarcoma, synovioma, mesothelioma, 
Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon carcinoma, pancreatic 
cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell carcinoma, basal 
cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland carcinoma, 
papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary 

30 carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct 
carcinoma, choriocarcinoma, •eminoma, embryonal carcinoma, Wilms 1 tumor, 
cervical cancer, testicular tumor, lung carcinoma, email cell lung carcinoma, bladder 
oaroinoma, apithalial caroinoma, glioma, astrocytoma, madulloblaatoma, 
craniopharyngioma, ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, 
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oligodendroglioma, meningioma, melanoma, neuroblastoma, and retinoblastoma. The 
invention is applicable to sarcomas and epithelial cancers, such as ovarian cancers and 
breast cancers. 

In additional embodiments, the at least one agent is an antibody, peptide or 
5 small molecule. In further embodiments, the antibody, peptide, anti-sense, siRNA, or 
small molecule is directed against an extracellular domain of the at least one stem cell 
cancer marker. In some embodiments, the at least one stem cell cancer marker is 
selected from the group consisting of: Bmi-1, eed, easyhl, easyh2, m£2, yyl, 
smarcA3, smarcA5, smarcD3, smarcEl, mllt3, FZD1, FZD2, FZD3, FZD4, FZD6, 

10 FZD7, FZD8, FZD9, FZD10, WNT2, WNT2B, WNT3, WNT5A, WNT10B, WNT16, 
AXIN1, BCL9, MYC, and (TCF4). 

In other embodiments, the present invention provides methods for reducing 
the size of a solid tumor, comprising contacting cells of a solid tumor with a 
biologically (e.g. therapeutically) effective amount of a composition comprising at 

15 least one agent that modulates the activity of at least one stem cell cancer marker 

shown in Tables 4-8. In some embodiments, the present invention provides methods 
for killing or inhibiting the proliferation of solid tumor stem cells comprising 
contacting the solid tumor stem cells with a biologically effective amount of a 
composition comprising at least one agent targeted to at least one stem cell cancer 

20 marker shown in Tables 4-8. In certain embodiments, the methods further comprise 
identifying the death of or the prevention of the growth of the solid tumor stem cells 
following the contacting. In additional embodiments, the cell death is caused by 
apoptosis. In other embodiments, the biologically effective amount is an amount 
interferences with the survival pathyways (e.g. notch related genes) or self-renewal 

25 pathaways (e.g. WNT pathways) of the solid tumor stem cell. In other embodiments, 
the at least one stem cell cancer marker is selected from the group consisting of: Bmi- 
1, eed, easyhl, easyh2, rnf2, yyl, smarcA3, smarcAS, smarcD3, smarcEl, mllt3, 
FZD1, FZD2, FZD3, FZD4, FZD6, FZD7, FZD8, FZD9, FZD10, WNT2, WNT2B, 
WNT3, WNT5A, WNT10B, WNT16, AXIN1, BCL9, MYC, and (TCF4). 

30 In particular embodiments, the solid tumor stem cells express cell surface 

marker CD44, ESA, or B38.1 . In other embodiments, the solid tumor stem cells fail 
to express at least one LINEAGE marker selected from the group consisting of CD2, 
CD3, CD IO, CD 14, CD16, CD31, CD45, CD64, and CD140b (see,e.g., US Pat. Pub. 
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US20040037815A1, and US200201 19565, both of which are herein incorporated by 
reference). 

In other embodiments, the present invention provides methods for selectively 
targeting a solid tumor stem cell comprising, (a) identifying at least one stem cell 
5 cancer marker from Tables 4-8 present on a solid tumor stem cell; and (b) obtaining 
an agent or set of agents that selectively binds to or regulates the at least one stem cell 
cancer marker. In some embodiments, the agent genetically modifies the solid tumor 
stem cell. In particular embodiments, the agent comprises a bi-specific conjugate. In 
further embodiments, the agent comprises an adenoviral vector. 

1 0 In some embodiments, the present invention provides methods for forming a 

tumor in an animal, comprising: introducing purified solid tumor stem cells (e.g. a cell 
dose of) into an animal, wherein: (a) the solid tumor stem cells are derived from a 
solid tumor; and (b) the solid tumor stem cells are enriched at least 2-fold relative to 
unfractionated tumor cells based on the presence of at least one stem cell cancer 

15 marker in Tables 4-8. In other embodiments, the animal is an immunocompromised 
animal. In certain embodiments, the animal is an immunocompromised mammal, 
such as a mouse (e.g., a nude mouse, SCID mouse, NOD/SCID mouse, Beige/SCID 
mouse; and microglobin deficient NOD/SCID mouse). In particular embodiments, 
the number of cells in the cell dose is between about 100 cells and about 5xl0 5 cells. 

20 In certain embodiments, the present invention provides kits for detecting solid 

tumor stem cells in a subject, comprising: a) a reagent capable of specifically 
detecting at least one stem cell cancer marker from Tables 4-8 in a tissue or cell 
sample from a subject, and, optionally, b) instructions for using the reagent for 
detecting the presence or absence of solid tumor stem cells in the tissue sample. In 

25 further embodiments, the reagent comprises a nucleic acid probe complementary to 
mRNA from the at least one stem cell cancer marker. In other embodiments, the 
reagent comprises an antibody or antibody fragment. 

In some embodiments, the present invention provides methods of screening 
compounds, comprising: a) providing; i) a solid tumor stem cell; and ii) one or more 

30 test compounds; and b) contacting the solid tumor stem cell with the test compound; 
and c) detecting a change in expression of at least one stem cell cancer marker shown 
in Tables 4-8 in the presence of the test compound relative to the absence of the test 
compound. In particular embodiments, the detecting comprises determining an 
expression level for the at least one stem cell cancer marker. In particular 
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embodiments, the detecting comprises detecting mRNA expression of the at least one 
stem cell cancer marker. In some embodiments, the detecting comprises detecting 
polypeptide expression of the at least one stem cell cancer marker. In additional 
embodiments, the solid tumor stem cell is in vitro. In other embodiments, the solid 
5 tumor stem cell is in vivo. In further embodiments, the test compound comprises a 
drug (e.g. small molecule, antibody, antibody-toxin conjugate, siRNA, etc.). 

In some embodiments, the present invention provides compositions 
comprising at least two agents (e.g. small molecule, antibody, antibody-toxin 
conjugate, siRNA, etc.), wherein each of the agents modulates the activity of at least 

10 one stem cell cancer marker shown in Tables 4-8. In additional embodiments, the 
composition comprises at least three agents. 

In particular embodiments, the present invention provides methods of 
distinguishing tumorigenic from non-tumorigenic cancer cells, comprising: detecting 
the presence of p-catenin in a cancer cell such that the localization of p -catenin in the 

1 5 cancer cell is determined to be primarily nuclear or primarily cytoplasmic. In some 
embodiments, the method further comprises identifying the cancer cell as tumorigenic 
if the p -catenin localization is primarily nuclear, or identifying the cancer cell as non- 
tumorigenic if the P -catenin localization is primarily cytoplasmic. 

In certain embodiments, the present invention provides methods of 

20 distinguishing a tumorigenic from a non-tumorigenic cancer cell, comprising; a) 

providing; i) a cancer cell, and ii) a composition comprising an agent configured to 
bind p-catenin; and b) contacting the cancer cell with the composition under 
conditions such that the localization of P -catenin in the cancer cell is determined to 
be primarily nuclear or primarily cytoplasmic, and c) identifying the cancer cell as 

25 tumorigenic if the p -catenin localization is primarily nuclear, or identifying the 

cancer cell as non-tumorigenic if the p -catenin localization is primarily cytoplasmic. 

DESCRIPTION OF THE FIGURES 

Figure 1 shows isolation of tumorigenic cells. 
30 Figure 2 shows the DNA content of tumorigenic and non-tumorigenic 

breast cancer cells. 

Figure 3 shows histology from the CD24 + injection site (a), (20x 

objective magnification) revealed only normal mouse tissue while the CD24' 

6 
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/low injection site (b),.(40x objective magnification) contained malignant 
cells, (c) A representative tumor in a mouse at the CD44 + CD24" /low Lineage* 
injection site, but not at the CD44 + CD24 + Lineage" injection site. T3 cells were 
stained with Papanicolaou stain and examined microscopically (lOOx 
5 objective). Both the non-tumorigenic (c) and tumorigenic (d) populations 

contained cells with a neoplastic appearance, with large nuclei and prominent 
nucleoli. 

Figure 4 shows the phenotypic diversity in tumors arising from 
CD44+CD24-/lowLineage- cells. 
10 Figure 5 shows the expression of Wnt (left panel) and Frizzled (right 

panel). 

Figure 6 shows the isolation of normal tumor fibroblasts and 
endothelial cells. 

Figures 7 shows infection of breast cancer stem cells with an 
15 adenovirus vector. 

Figure 8 shows subcellular localization of P-catenin. 

Figure 9 shows inhibition of P-catenin signaling in cancer cells. 

GENERAL DESCRIPTION 

20 The present invention relates to compositions and methods for treating, 

characterizing and diagnosing cancer. In particular, the present invention provides 
gene expression profiles associated with solid tumor stem cells, as well as novel 
markers useful for the diagnosis, characterization, and treatment of solid tumor stem 
cells. Suitable markers that may be targeted (e.g. for diagnostic or therapeutic 

25 purposes) are the genes and peptides encoded by the genes that are differentially 
expressed in solid tumor stem cells as shown in Tables 4-8. The differentially 
expressed genes, and the peptides encoded thereby, may be detected (e.g. 
quantitatively) in order to identify the presence of solid tumor stem cells, and to 
determine and screen molecules suitable for reducing the proliferation (or killing), 

30 interfering with self-renewal pathways, or interfering with survival pathways of any 
solid tumor stem cells that are present. The differentially expressed genes, and 
peptides encoded thereby, shown in these tables are also useful for generating 
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therapeutic agents targeted to one or more of these markers (e.g. to inhibit or promote 
the activity of the marker). 

In order to identify solid tumor stem cell markers, cells from 5 patients, AML 
stem cells and non-tumorigenic cancer cells from 6 patients, normal hematopoietic 
5 stem cells (HSC5), normal hematopoietic cells, normal colon epithelial cells, and 
normal breast epithelial cells were analyszed for differential expression. 

The present invention also provides solid tumor stem cells that differentially 
express from other cells one or more of the markers provided in Tables 4-8. The solid 
tumor stem cells can be human or other animal. The expression can be either to a 
10 greater extent or to a lesser extent. The other cells can be selected from normal cells, 
hematopoietic stem cells, acute myelogenous leukemia (AML) stem cells, or any 
other class of cells. 

The invention provides a method of selecting cells of a population, which 
results in a purified population of solid tumor stem cells (e.g. from a patient to select 
1 5 or test therapetuic agents are preferred for the patient). The present invention also 

provides a method of selecting a purified population of tumor cells other than solid 

1 

tumor stem cells, such as a population of non-tumorigenic (NTG) tumor cells. The 
present invention provides methods of raising antibodies to the selected cells. The 
invention provides diagnostic methods using the selected cells. The invention also 
20 provides therapeutic methods, where the therapeutic is directed to a solid tumor stem 
cell (e.g. directed to one of the stem cells cancer markers identified herein directly or 
indirectly). 

Accordingly, the invention provides methods of selecting cells, diagnosing 
disease, conducting research studies, and treating solid tumors using selection 

25 methods, diagnostic methods and therapeutics directed to specific genes on a given 
pathway. Included are one or more of the following genes and gene products: Bmi-1, 
eed, easyhi, easyh2, mf2, yyl, smarcA3, smarcA5, smarcD3, smarcE 1 and mllt3, as 
well as those shown in Tables 4-8. Many of these genes are differentially expressed 
in solid tumor stem cells as compared with normal cells and non-tumorigenic cancer 

30 cells, as shown herein. 

The invention provides in vivo and in vitro assays of solid tumor stem cell 
function and cell function by the various populations of cells isolated from a solid 

8 
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tumor. The invention provides methods for using the various populations of cells 
isolated from a solid tumor (such as a population of cells enriched for solid tumor 
stem cells) to identify factors influencing solid tumor stem cell proliferation. By the 
methods of the present invention, one can characterize the phenotypically 
5 heterogeneous populations of cells within a solid tumor. In particular, one can 
identify, isolate, and characterize a phenotypically distinct cell population within a 
tumor having the stem cell properties of extensive proliferation and the ability to give 
rise to all other tumor cell types. Solid tumor stem cells are the tumorigenic cells that 
are capable of re-establishing a tumor following treatment. 

10 The invention thus provides a method for selectively targeting diagnostic or 

therapeutic agents to solid tumor stem cells. The invention also provides an agent, 
such as a biomolecule, that is selectively targeted to solid tumor stem cells (e.g. 
directed to one of the solid tumor stem cell cancer markers disclosed herein). In 
preferred embodiments, the stem cell cancer marker this targeted is part of a self- 

15 renewal or cell survival pathway. One example of such a marker is Bmi-1 , which was 
shown to be required for maintenance of adult self-renewing heamatopoietic stem 
cells (see, e.g., Park et al., Nature, 2003 May 15;423(6937):302-5, herein 
incorporated by reference). 

In certain embodiments, the present invention provides methods for screening 
20 for anti-cancer agents; for the testing of anti-cancer therapies; for the development of 
drugs targeting novel pathways; for the identification of new anti -cancer therapeutic 
targets; the identification and diagnosis of malignant cells in pathology specimens; for 
the testing and assaying of solid tumor stem cell drug sensitivity; for the measurement 
of specific factors that predict drug sensitivity; and for the screening of patients (e.g., 
25 as an adjunct for mammography). 

Other features, objects, and advantages of the invention will be apparent from 
the detailed description below. Additional guidance is provided in Published PCT 
patent application WO 02/12447 by the Regents of the University of Michigan and 
PCT patent application PCT/US02/39191 by the Regents of the University of 
30 Michigan, both of which are incorporated herein by reference. 
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DEFINITIONS 

To facilitate an understanding of the present invention, a number of terms and 
phrases are defined below: 

As used here, the term "antibody" is used in the broadest sense and 
5 specifically covers monoclonal antibodies (including full length monoclonal 
antibodies), polyclonal antibodies, multispecific antibodies (e.g., bispecific 
antibodies), and antibody fragments so long as they exhibit the desired biological 
activity (e.g. able to bind a stem cell cancer marker as described herein). Antibodies 
may be conjugated to other molecules (e.g., toxins). 

10 As used herein, the term "antibody fragments" refers to a portion of an intact 

antibody. Examples of antibody fragments include, but are not limited to, linear 
antibodies; single-chain antibody molecules; Fc or Fc* peptides, Fab and Fab 
fragments, and multispecific antibodies formed from antibody fragments. 

As used herein, "humanized" forms of non-human (e.g., murine) antibodies 

15 are chimeric antibodies that contain minimal sequence, or no sequence, derived from 
non-human immunoglobulin. For the most part, humanized antibodies are human 
immunoglobulins (recipient antibody) in which residues from a hypervariable region 
of the recipient are replaced by residues from a hypervariable region of a non-human 
species (donor antibody) such as mouse, rat, rabbit or nonhuman primate having the 

20 desired specificity, affinity, and capacity. In some instances, Fv framework region 
(FR) residues of the human immunoglobulin are replaced by corresponding non- 
human residues. Furthermore, 1 humanized antibodies may comprise residues that are 
not found in the recipient antibody or in the donor antibody. These modifications are 
generally made to further refine antibody performance. In general, the humanized 

25 antibody will comprise substantially all of at least one, and typically two, variable 
domains, in which all or substantially all of the hypervariable loops correspond to 
those of a nonhuman immunoglobulin and all or substantially all of the FR residues 
are those of a human immunoglobulin sequence. The humanized antibody may also 
comprise at least a portion of an immunoglobulin constant region (Fc), typically that 

30 of a human immunoglobulin. Examples of methods used to generate humanized 

antibodies are described in U.S. Pat. 5,225,539 to Winter et al. (herein incorporated 
by reference). 

"Enriched", as in an enriched population of cells, can be defined 
phenotypically based upon the increased number of cells having a particular marker 

10 
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(e.g. as shown in Tables 4-8) in a fractionated set of cells as compared with the 
number of cells having the marker in the unfractionated set of cells. However, the 
term "enriched can be preferably defined fucntionally by tumorigenic function as the 
minimum number of cells that form tumors at limit dilution frequency in test mice. 
5 For example, if 500 tumor stem cells form tumors in 63% of test animals, but 5000 
unfractionated tumor cells are required to form tumors in 63% of test animals, then 
the solid tumor stem cell population is 10- fold enriched for tumorigenic activity. The 
stem cell cancer markers of the present invention can be used to generate enriched 
populations of cancer stem cells. In preferred embodiments, the stem cell population 

1 0 is enriched at least 1.4 fold relative to unfractioned tumor cells (e.g. 1.4 fold, 1 .5 fold, 
2 fold, 5 fold ....20 fold). 

"Isolated" in regard to cells, refers to a cell that is removed from its natural 
environment (such as in a solid tumor) and that is isolated or separated, and is at least 
about 30%, 50%, 75% free, and most preferably about 90% free, from other cells with 

1 5 which it is naturally present, but which lack the marker based on which the cells were 
isolated. The stem cell cancer markers of the present invention can be used to 
generate isolated populations of cancer stem cells. 

As used herein, the term "receptor binding domain" refers to any native ligand 
for a receptor, including cell adhesion molecules, or any region or derivative of such 

20 native ligand retaining at least a qualitative receptor binding ability of a corresponding 
native ligand. 

As used herein, the term "antibody-immunoadhesin chimera" comprises a 
molecule that combines at least one binding domain of an antibody with at least one 
immunoadhesin. Examples include, but are not limited to, the bispecific CD4-IgG 
25 chimeras described in Berg et al., PNAS (USA) 88:4723-4727 (1991) and Chamow et 
al., J. Immunol., 153:4268 (1994), both of which are hereby incorporated by 
reference. 

As used herein, the terms "cancer" and "cancerous" refer to or describe the 
physiological condition in mammals that is typically characterized by unregulated cell 
30 growth. Examples of cancer include, but are not limited to, carcinoma, lymphoma, 
blastoma, sarcoma, and leukemia. More particular examples of such cancers include 
squamous cell cancer, small-cell lung cancer, non-small cell lung cancer, 
adenocarcinoma of the lung, squamous carcinoma of the lung, cancer of the 
peritoneum, hepatocellular cancer, gastrointestinal cancer, pancreatic cancer, 
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glioblastoma, cervical cancer, ovarian cancer, liver cancer, bladder cancer, hepatoma, 
breast cancer, colon cancer, colorectal cancer, endometrial or uterine carcinoma, 
salivary gland carcinoma, kidney cancer, liver cancer, prostate cancer, vulval cancer, 
thyroid cancer, hepatic carcinoma and various types of head and neck cancer. 

5 The term "epitope" as used herein refers to that portion of an antigen that 

makes contact with a particular antibody. 

When a protein or fragment of a protein is used to immunize a host animal, 
numerous regions of the protein may induce the production of antibodies which bind 
specifically to a given region or three-dimensional structure on the protein; these 

10 regions or structures are referred to as "antigenic determinants". An antigenic 

determinant may compete with the intact antigen (i.e., the "immunogen" used to elicit 
the immune response) for binding to an antibody. 

The terms "specific binding" or "specifically binding" when used in reference 
to the interaction of an antibody and a protein or peptide means that the interaction is 

1 5 dependent upon the presence of a particular structure (i.e., the antigenic determinant 
or epitope) on the protein; in other words the antibody is recognizing and binding to a 
specific protein structure rather than to proteins in general. For example, if an 
antibody is specific for epitope "A," the presence of a protein containing epitope A (or 
free, unlabelled A) in a reaction containing labeled "A" and the antibody will reduce 

20 the amount of labeled A bound to the antibody. 

As used herein, the terms "non-specific binding" and "background binding" 
when used in reference to the interaction of an antibody and a protein or peptide refer 
to an interaction that is not dependent on the presence of a particular structure (i.e, 
the antibody is binding to proteins in general rather that a particular structure such as 

25 an epitope). 

As used herein, the term "subject" refers to any animal (e.g., a mammal), 
including, but not limited to, humans, non-human primates, rodents, and the like, 
which is to be the recipient of a particular treatment. Typically, the terms "subject" 
and "patient" are used interchangeably herein in reference to a human subject. 
30 As used herein, the term "subject suspected of having cancer" refers to a 

subject that presents one or more symptoms indicative of a cancer (e.g., a noticeable 
lump or mass) or is being screened for a cancer (e.g., during a routine physical). A 
subject suspected of having cancer may also have one or more risk factors. A subject 
suspected of having cancer has generally not been tested for cancer. However, a 
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"subject suspected of having cancer" encompasses an individual who has received an 
initial diagnosis but for whom the stage of cancer is not known. The term further 
includes people who once had cancer (e.g., an individual in remission). 

As used herein, the term "subject at risk for cancer" refers to a subject with 

5 one or more risk factors for developing a specific cancer. Risk factors include, but are 
not limited to, gender, age, genetic predisposition, environmental expose, previous 
incidents of cancer, preexisting non-cancer diseases, and lifestyle. 

As used herein, the term "characterizing cancer in subject" refers to the 
identification of one or more properties of a cancer sample in a subject, including but 

0 not limited to, the presence of benign, pre-cancerous or cancerous tissue, the stage of 
the cancer, and the subject's prognosis. Cancers may be characterized by the 
identification of the expression of one or more cancer marker genes, including but not 
limited to, the cancer markers disclosed herein. 

As used herein, the term "stem cell cancer markers" refers to a gene or peptide 

5 expressed by the gene whose expression level, alone or in combination with other 
genes, is correlated with the presence of tumorigenic cancer cells. The correlation 
may relate to either an increased or decreased expression of the gene (e.g. increased or 
decreased levels of mRNA or the peptide encoded by the gene). 

As used herein, the term "a reagent that specifically detects expression levels" 

0 refers to reagents used to detect the expression of one or more genes (e.g., including 
but not limited to, the cancer markers of the present invention). Examples of suitable 
reagents include but are not limited to, nucleic acid probes capable of specifically 
hybridizing to the gene of interest, aptamers, PCR primers capable of specifically 
amplifying the gene of interest, and antibodies capable of specifically binding to 

:5 proteins expressed by the gene of interest. Other non-limiting examples can be found 
in the description and examples below. 

As used herein, the term "detecting a decreased or increased expression 
relative to non-cancerous control" refers to measuring the level of expression of a 
gene (e.g., the level of mRNA or protein) relative to the level in a non-cancerous 

>0 control sample. Gene expression can be measured using any suitable method, 
including but not limited to, those described herein. 

As used herein, the term "detecting a change in gene expression in a a cell 
sample in the presence of said test compound relative to the absence of said test 
compound" refers to measuring an altered level of expression (e.g., increased or 
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decreased) in the presence of a test compound relative to the absence of the test 

compound. Gene expression can be measured using any suitable method. 

As used herein, the term "instructions for using said kit for detecting cancer in 
said subject" includes instructions for using the reagents contained in the kit for the 
5 detection and characterization of cancer in a sample from a subject. 

As used herein, the term "providing a prognosis" refers to providing 
information regarding the impact of the presence of cancer (e.g., as determined by the 
diagnostic methods of the present invention) on a subject's future health (e.g., 
expected morbidity or mortality, the likelihood of getting cancer, and the risk of 
10 metastasis). 

As used herein, the term "post surgical tumor tissue" refers to cancerous tissue 
(e.g., biopsy tissue) that has been removed from a subject (e.g., during surgery). 

As used herein, the term "subject diagnosed with a cancer" refers to a subject 
who has been tested and found to have cancerous cells. The cancer may be diagnosed 

15 using any suitable method, including but not limited to, biopsy, x-ray, blood test, and 
the diagnostic methods of the present invention. 

As used herein, the term "biopsy tissue" refers to a sample of tissue that is 
removed from a subject for the purpose of determining if the sample contains 
cancerous tissue. In some embodiment, biopsy tissue is obtained because a subject is 

20 suspected of having cancer. The biopsy tissue is then examined for the presence or 
absence of cancer. 

As used herein, the term "gene transfer system" refers to any means of 
delivering a composition comprising a nucleic acid sequence to a cell or tissue. For 
example, gene transfer systems include, but are not limited to, vectors (e.g., retroviral, 

25 adenoviral, adeno-associated viral, and other nucleic acid-based delivery systems), 

microinjection of naked nucleic acid, polymer-based delivery systems (e.g., liposome- 
based and metallic particle-based systems), biolistic injection, and the like. As used 
herein, the term "viral gene transfer system" refers to gene transfer systems 
comprising viral elements (e.g., intact viruses, modified viruses and viral components 

30 such as nucleic acids or proteins) to facilitate delivery of the sample to a desired cell 
or tissue. As used herein, the term "adenovirus gene transfer system" refers to gene 
transfer systems comprising intact or altered viruses belonging to the family 
Adenoviridae. 
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As used herein, the term "site-specific recombination target sequences" refers 
to nucleic acid sequences that provide recognition sequences for recombination 
factors and the location where recombination takes place. 

As used herein, the term "nucleic acid molecule" refers to any nucleic acid 
5 containing molecule, including but not limited to, DNA or RNA. The term 

encompasses sequences that include any of the known base analogs of DNA and RNA 
including, but not limited to, 4-acetylcytosine, 8-hydroxy-N6-methyladenosine, 
aziridinylcytosine, pseudoisocytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2-thiouracil, 
1 0 5-carboxymethylaminomethyluracil, dihydrouracil, inosine, N6-isopentenyladenine, 

1- methyladenine, 1-methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2- 
dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 
5-methylcytosine, N6-methyIadenine, 7-methylguanine, 5-methylaminomethyluracil, 
5 -methoxy aminomethy 1-2-thiouracil, beta-D-mannosy lqueosine, 

1 5 S'-methoxycarbonylmethyluracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid methylester, 
uracil-5-oxyacetic acid, oxybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5- 
methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, N-uracil-5-oxyacetic 
acid methylester, uracil-5-oxyacetic acid, pseudouracil, queosine, 2-thiocytosine, and 

20 2,6-diaminopurine. 

The term "gene" refers to a nucleic acid (e.g., DNA) sequence that comprises 
coding sequences necessary for the production of a polypeptide, precursor, or RNA 
{e.g., rRNA, tRNA). The polypeptide can be encoded by a full length coding 
sequence or by any portion of the coding sequence so long as the desired activity or 

25 functional properties (e.g., enzymatic activity, ligand binding, signal transduction, 
immunogenicity, etc.) of the full-length or fragment are retained. The term also 
encompasses the coding region of a structural gene and the sequences located adjacent 
to the coding region on both the 5* and 3' ends for a distance of about 1 kb or more on 
either end such that the gene corresponds to the length of the full-length mRNA. 

30 Sequences located 5* of the coding region and present on the mRNA are referred to as 
5' non-translated sequences. Sequences located 3 ! or downstream of the coding region 
and present on the mRNA are referred to as 3' non-translated sequences. The term 
"lint" tneompaiMi both qDNA Mid § •nomio formi of a |tm, A gmomie form or 
clone of a gene contains the coding region interrupted with non-coding sequences 
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termed "introns" or "intervening regions" or "intervening sequences." Introns are 
segments of a gene that are transcribed into nuclear RNA (hnRNA); introns may 
contain regulatory elements such as enhancers. Introns are removed or "spliced out" 
from the nuclear or primary transcript; introns therefore are absent in the messenger 
5 RNA (mRNA) transcript. The mRNA functions during translation to specify the 
sequence or order of amino acids in a nascent polypeptide. 

As used herein, the term "heterologous gene" refers to a gene that is not in its 
natural environment. For example, a heterologous gene includes a gene from one 
species introduced into another species. A heterologous gene also includes a gene 

10 native to an organism that has been altered in some way (e.g., mutated, added in 

multiple copies, linked to non-native regulatory sequences, etc). Heterologous genes 
are distinguished from endogenous genes in that the heterologous gene sequences are 
typically joined to DNA sequences that are not found naturally associated with the 
gene sequences in the chromosome or are associated with portions of the chromosome 

15 not found in nature (e.g., genes expressed in loci where the gene is not normally 
expressed). 

As used herein, the term "gene expression" refers to the process of converting 
genetic information encoded in a gene into RNA (e.g, mRNA, rRNA, tRNA, or 
snRNA) through "transcription" of the gene (e.g., via the enzymatic action of an RNA 

20 polymerase), and for protein encoding genes, into protein through "translation" of 
mRNA. Gene expression can be regulated at many stages in the process. "Up- 
regulation" or "activation" refers to regulation that increases the production of gene 
expression products (e.g., RNA or protein), while "down-regulation" or "repression" 
refers to regulation that decrease production. Molecules (e.g., transcription factors) 

25 that are involved in up-regulation or down-regulation are often called "activators" and 
"repressors," respectively. 

In addition to containing introns, genomic forms of a gene may also include 
sequences located on both the 5' and 3' end of the sequences that are present on the 
RNA transcript. These sequences are referred to as "flanking" sequences or regions 

30 (these flanking sequences are located 5' or 3' to the non-translated sequences present 
on the mRNA transcript). The 5 1 flanking region may contain regulatory sequences 
such as promoters and enhancers that control or influence the transcription of the 
gene. The 3' flanking region may contain sequences that direct the termination of 
transcription, post-transcriptional cleavage and polyadenylation. 
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The term "siRNAs" refers to short interfering RNAs. In some embodiments, 
siRNAs comprise a duplex, or double-stranded region, of about 18-25 nucleotides 
long; often siRNAs contain from about two to four unpaired nucleotides at the 3' end 
of each strand. At least one strand of the duplex or double-stranded region of a 

5 siRNA is substantially homologous to or substantially complementary to a target 
RNA molecule. The strand complementary to a target RNA molecule is the 
"antisense strand;" the strand homologous to the target RNA molecule is the "sense 
strand," and is also complementary to the siRNA antisense strand. siRNAs may also 
contain additional sequences; non-limiting examples of such sequences include 
0 linking sequences, or loops, as well as stem and other folded structures. siRNAs 
appear to function as key intermediaries in triggering RNA interference in 
invertebrates and in vertebrates, and in triggering sequence-specific RNA degradation 
during posttranscriptional gene silencing in plants. 

The term "RNA interference" or "RNAi" refers to the silencing or decreasing 
5 of gene expression by siRNAs. It is the process of sequence-specific, post- 
transcriptional gene silencing in animals and plants, initiated by siRNA that is 
homologous in its duplex region to the sequence of the silenced gene. The gene may 
be endogenous or exogenous to the organism, present integrated into a chromosome 
or present in a transfection vector that is not integrated into the genome. The 

20 expression of the gene is either completely or partially inhibited. RNAi may also be 
considered to inhibit the function of a target RNA; the function of the target RNA 
may be complete or partial. 

As used herein, the terms "nucleic acid molecule encoding," "DNA sequence 
encoding," and "DNA encoding" refer to the order or sequence of 

25 deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these 
deoxyribonucleotides determines the order of amino acids along the polypeptide 
(protein) chain. The DNA sequence thus codes for the amino acid sequence. 

As used herein, the terms "an oligonucleotide having a nucleotide sequence 
encoding a gene" and "polynucleotide having a nucleotide sequence encoding a gene," 

30 means a nucleic acid sequence comprising the coding region of a gene or in other 

words the nucleic acid sequence that encodes a gene product. The coding region may 

be present in a cDNA, genomic DNA or RNA form. When present In a DNA fbrm, 
the oligonucleotide or polynucleotide may be single-stranded (i.e., the sense strand) or 
double-stranded. Suitable control elements such as enhancers/promoters, splice 



17 



WO 2005/005601 



PCT/US2004/018266 



junctions, polyadenylation signals, etc. may be placed in close proximity to the coding 
region of the gene if needed to permit proper initiation of transcription and/or correct 
processing of the primary RNA transcript. Alternatively, the coding region utilized in 
the expression vectors of the present invention may contain endogenous 

5 enhancers/promoters, splice junctions, intervening sequences, polyadenylation 

signals, etc. or a combination of both endogenous and exogenous control elements. 

As used herein the term "portion" when in reference to a nucleotide sequence 
(as in "a portion of a given nucleotide sequence") refers to fragments of that sequence. 
The fragments may range in size from four nucleotides to the entire nucleotide 
10 sequence minus one nucleotide (10 nucleotides, 20, 30, 40, 50, 100, 200, etc.). 

The terms "in operable combination," "in operable order," and "operably 
linked" as used herein refer to the linkage of nucleic acid sequences in such a manner 
that a nucleic acid molecule capable of directing the transcription of a given gene 
and/or the synthesis of a desired protein molecule is produced. The term also refers to 
15 the linkage of amino acid sequences in such a manner so that a functional protein is 
produced. 

The term "isolated" when used in relation to a nucleic acid, as in "an isolated 
oligonucleotide" or "isolated polynucleotide" refers to a nucleic acid sequence that is 
identified and separated from at least one component or contaminant with which it is 

20 ordinarily associated in its natural source. Isolated nucleic acid is such present in a 
form or setting that is different from that in which it is found in nature. In contrast, 
non-isolated nucleic acids as nucleic acids such as DNA and RNA found in the state 
they exist in nature. For example, a given DNA sequence (e.g., a gene) is found on 
the host cell chromosome in proximity to neighboring genes; RNA sequences, such as 

25 a specific mRNA sequence encoding a specific protein, are found in the cell as a 

mixture with numerous other mRNAs that encode a multitude of proteins. However, 
isolated nucleic acid encoding a given protein includes, by way of example, such 
nucleic acid in cells ordinarily expressing the given protein where the nucleic acid is 
in a chromosomal location different from that of natural cells, or is otherwise flanked 

30 by a different nucleic acid sequence than that found in nature. The isolated nucleic 
acid, oligonucleotide, or polynucleotide may be present in single-stranded or double- 
stranded form. When an isolated nucleic acid, oligonucleotide or polynucleotide is to 
be utilized to express a protein, the oligonucleotide or polynucleotide will contain at a 
minimum the sense or coding strand (i.e., the oligonucleotide or polynucleotide may 
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be single-stranded), but may contain both the sense and anti-sense strands {i.e., the 
oligonucleotide or polynucleotide may be double-stranded). 

"Amino acid sequence" and terms such as "polypeptide" or "protein" are not 
meant to limit the amino acid sequence to the complete, native amino acid sequence 
5 associated with the recited protein molecule. 

The term "native protein" as used herein to indicate that a protein does not 
contain amino acid residues encoded by vector sequences; that is, the native protein 
contains only those amino acids found in the protein as it occurs in nature. A native 
protein may be produced by recombinant means or may be isolated from a naturally 
0 occurring source. 

As used herein the term "portion" when in reference to a protein (as in "a 
portion of a given protein") refers to fragments of that protein. The fragments may 
range in size from four amino acid residues to the entire amino acid sequence minus 
one amino acid. 

5 The term "Southern blot," refers to the analysis of DNA on agarose or 

acrylamide gels to fractionate the DNA according to size followed by transfer of the 
DNA from the gel to a solid support, such as nitrocellulose or a nylon membrane. 
The immobilized DNA is then probed with a labeled probe to detect DNA species 
complementary to the probe used. The DNA may be cleaved with restriction enzymes 

0 prior to electrophoresis. Following electrophoresis, the DNA may be partially 

depurinated and denatured prior to or during transfer to the solid support. Southern 
blots are a standard tool of molecular biologists (J. Sambrook et aL, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, NY, pp 9.31-9.58 [1989]). 
The term "Northern blot," as used herein refers to the analysis of RNA by 

15 electrophoresis of RNA on agarose gels to fractionate the RNA according to size 

followed by transfer of the RNA from the gel to a solid support, such as nitrocellulose 
or a nylon membrane. The immobilized RNA is then probed with a labeled probe to 
detect RNA species complementary to the probe used. Northern blots are a standard 
tool of molecular biologists (J. Sambrook, et aL, supra, pp 7.39-7.52 [1989]). 

\Q The term "Western blot" refers to the analysis of protein(s) (or polypeptides) 

immobilized onto a support such as nitrocellulose or a membrane. The proteins are 
run on acrylamide gels to separate the proteins, followed by transfer of the protein 
from the gel to a solid support, such as nitrocellulose or a nylon membrane. The 
immobilized proteins are then exposed to antibodies with reactivity against an antigen 
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of interest. The binding of the antibodies may be detected by various methods, 
including the use of radiolabeled antibodies. 

The term "transgene" as used herein refers to a foreign gene that is placed into 
an organism by, for example, introducing the foreign gene into newly fertilized eggs 
5 or early embryos. The term "foreign gene" refers to any nucleic acid (e.g., gene 
sequence) that is introduced into the genome of an animal by experimental 
manipulations and may include gene sequences found in that animal so long as the 
introduced gene does not reside in the same location as does the naturally occurring 
gene. 

10 As used herein, the term "vector" is used in reference to nucleic acid 

molecules that transfer DNA segment(s) from one cell to another. The term "vehicle" 
is sometimes used interchangeably with "vector." Vectors are often derived from, 
plasmids, bacteriophages, or plant or animal viruses. 

The term "expression vector" as used herein refers to a recombinant DNA 

1 5 molecule containing a desired coding sequence and appropriate nucleic acid 

sequences necessary for the expression of the operably linked coding sequence in a 
particular host organism. Nucleic acid sequences necessary for expression in 
prokaryotes usually include a promoter, an operator (optional), and a ribosome 
binding site, often along with other sequences. Eukaryotic cells are known to utilize 

20 promoters, enhancers, and termination and polyadenylation signals. 

The terms "overexpression" and "overexpressing" and grammatical 
equivalents, are used in reference to levels of mRNA to indicate a level of expression 
approximately 1.5-fold higher (or greater) than that observed in a given tissue in a 
control or non-transgenic animal. Levels of mRNA are measured using any of a 

25 number of techniques known to those skilled in the art including, but not limited to 
Northern blot analysis. Appropriate controls are included on the Northern blot to 
control for differences in the amount of RNA loaded from each tissue analyzed {e.g., 
the amount of 28S rRNA, an abundant RNA transcript present at essentially the same 
amount in all tissues, present in each sample can be used as a means of normalizing or 

30 standardizing the mRNA-specific signal observed on Northern blots). The amount of 
mRNA present in the band corresponding in size to the correctly spliced transgene 
RNA is quantified; other minor species of RNA which hybridize to the transgene 
probe are not considered in the quantification of the expression of the transgenic 
mRNA. 
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As used herein, the term "in vitro" refers to an artificial environment and to 
processes or reactions that occur within an artificial environment. In vitro 
environments can consist of, but are not limited to, test tubes and cell culture. The 
term "in vivo" refers to the natural environment (e.g., an animal or a cell) and to 

5 processes or reaction that occur within a natural environment. 

The terms "test compound" and "candidate compound" refer to any chemical 
entity, pharmaceutical, drug, and the like that is a candidate for use to treat or prevent 
a disease, illness, sickness, or disorder of bodily function (e.g., cancer). Test 
compounds comprise both known and potential therapeutic compounds. A test 

0 compound can be determined to be therapeutic by screening using the screening 

methods of the present invention. In some embodiments of the present invention, test 
compounds include antisense compounds. 

As used herein, the term "sample" is used in its broadest sense. In one sense, 
it is meant to include a specimen or culture obtained from any source, as well as 

5 biological and environmental samples. Biological samples may be obtained from 
animals (including humans) and encompass fluids, solids, tissues, and gases. 
Biological samples include blood products, such as plasma, serum and the like. 
Environmental samples include environmental material such as surface matter, soil, 
water, crystals and industrial samples. Such examples are not however to be 

0 construed as limiting the sample types applicable to the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides compositions and methods for treating, 
characterizing, and diagnosing cancer. In particular, the present invention provides 
5 gene expression profiles associated with solid tumor stem cells, as well as novel 

markers useful for the diagnosis, characterization, and treatment of solid tumor stem 
cells. 

I. Stem Cells and Solid Tumor Stem Cells 
0 Common cancers arise in tissues that contain a large sub-population of 

proliferating cells that are responsible for replenishing the short-lived mature cells. In 
such organs, cell maturation is arranged in a hierarchy in which a rare population of 
stem cells give rise to the mature cells and perpetuate themselves through a process 
called self renewal 1 " 11 . Due to their rarity, stem cells should be isolated in order to 
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study their biological, molecular, and biochemical properties. Although it is likely 
that they give rise to most tissues, stem cells have been rigorously identified and 
purified in only a few tissues. The stem cells that give rise to the lympho- 
hematopoietic system, called hematopoietic stem cells (HSCs), have been isolated 
5 from mice and humans and are the best characterized stem cells. The utility of tissue 
containing HSCs has been demonstrated in cancer therapy with their extensive use for 
bone marrow transplantation to regenerate the hematolymphoid system following 
myeloablative protocols' 2 . The prospective isolation of HSCs from patients can result 
in a population that is cancer free for autologous transplantation 13 " 17 . 
1 0 Understanding the cellular biology of the tissues in which cancers arise, and 

specifically of the stem cells residing in those tissues, provides new insights into 
cancer biology. Several aspects of stem cell biology are relevant to cancer. First, 
both normal stem cells and cancer stem cells undergo self-renewal, and emerging 
evidence suggests that similar molecular mechanisms regulate self-renewal in normal 
1 5 stem cells and their malignant counterparts. Next, it is quite likely that mutations that 
lead to cancer accumulate in normal stem cells. Finally, it is likely that tumors 
contain a "cancer stem cell" population with indefinite proliferative potential that 
drives the growth and metastasis of tumors 18 " 26 . 

HSCs are the most studied and best understood somatic stem cell population'. 
20 Hematopoiesis is a tightly regulated process in which a pool of hematopoietic stem 
cells eventually gives rise to the lymphohematopoietic system consisting of the 
formed blood elements, e.g., red blood cells, platelets, granulocytes, macrophages, 
and B-and T-lymphocytes. These cells are important for oxygenation, prevention of 
bleeding, immunity, and infections, respectively. In the adult, HSCs have two 
25 fundamental properties. First, HSCs need to self-renew in order to maintain the stem 
cell pool; the total number of HSCs is under strict genetic regulation 27 . Second, they 
must undergo differentiation to maintain a constant pool of mature cells in normal 
conditions, and to produce increased numbers of a particular lineage in response to 
stresses such as bleeding or infection. 
30 In the hematopoietic system, multipotent cells constitute 0.05% of mouse bone 

marrow cells and are heterogeneous with respect to their ability to self-renew. There 
are three different populations of multipotent cells: long-term self-renewing HSCs, 
short-term self-renewing HSCs, and multipotent progenitors without detectable self- 
renewal potential 7,28 . These populations form a hierarchy in which the long-term 
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HSCs give rise to short-term HSCs, which in turn give rise to multipotent progenitors 
[Fig. 1 in 7 ]. As HSCs mature from the long-term self-renewing pool to multipotent 
progenitors they become more mitotically active but lose the ability to self-renew. 
Only long-term HSCs can give rise to mature hematopoietic cells for the lifetime of 
5 the animal, while short-term HSCs and multipotent progenitors reconstitute lethally 
irradiated mice for less than eight weeks 7 . 

Despite the fact that the phenotypic and functional properties of mouse and 
human HSCs have been extensively characterized 2 , our understanding of the 
fundamental stem cell property, self-renewal, is minimal 25,29,30 . In most cases, HSCs 

1 0 differentiate when exposed to combinations of growth factors that can induce 

extensive proliferation in long-term cultures 31 . Although recent progress has been 
made in identifying culture conditions that maintain HSC activity in culture for a 
limited period of time [for example see Miller and Graves 32 ], it has proven to be 
exceedingly difficult to identify tissue culture conditions that promote a significant 

1 5 and prolonged expansion of progenitors with transplantable HSC activity. 

Maintenance of a tissue or a tumor is determined by a balance of proliferation 
and cell death 33 . In a normal tissue, stem cell numbers are under tight genetic 
regulation resulting in maintenance a constant number of stem cells in the 
organ 27,34,35 . By contrast, cancer cells have escaped this homeostatic regulation and 

20 the number of cells within a tumor that have the ability to self renew is constantly 
expanding, resulting in the inevitable growth of the tumor. As would be expected, 
many of the mutations that drive tumor expansion regulate either cell proliferation or 
survival. For example, the prevention of apoptosis by enforced expression of the 
oncogene Bcl-2 promotes the development of lymphoma and also results in increased 

25 numbers of HSCs in vivo, suggesting that cell death plays a role in regulating the 

homeostasis of HSCs 36,37 . In fact, the progression to experimental acute myelogenous 
leukemia in mice requires at least 3, and likely 4 independent events to block the 
several intrinsically triggered and extrinsically induce programmed cell death 
pathways of myeloid cells 38 . Proto-oncogenes such as c-myb and c-myc that drive 

30 proliferation of tumor cells are also essential for HSCs development 39 " 42 . 

Since cancer cells and normal stem cells share the ability to self-renew, it is 
not surprising that a number of genes classically associated with cancer may also 
regulate normal stem cell development [reviewed in 25,43 ]. In combination with other 
growth factors, Shh signaling has also been implicated in the regulation of self- 
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renewal by the finding that cells highly enriched for human hematopoietic stem cells 
(CD34 + Lin CD38") exhibited increased self-renewal in response to Shh stimulation in 
vitro 44 . Several other genes related to oncogenesis have been shown to be important 
for stem cell function. For example, mice deficient for tal-l/SCL, which is involved 
5 in some cases of human acute leukemia, lack embryonic hematopoiesis 45 , suggesting 
that it is required for intrinsic or extrinsic events necessary to initiate hematopoiesis, 
for maintenance of the earliest definitive blood cells, or for the decision to form blood 
cells downstream of embryonic HSCs 45,46 . Members of the Hox family have also 
been implicated in human leukemia. Enforced expression of HoxB4 can affect stem 

10 cell functions 47,48 . One of the major targets of the p53 tumor suppressor gene is 
p21 c,pl . Bone marrow from p21 c,pl deficient mice has a reduced ability to serially 
reconstitute lethally irradiated recipients. Failure at serial transfer could result from 
exhaustion of the stem cell pool, loss of telomeres, or loss of transplantability 49 . In 
mice, bmi-1, a gene that cooperates with c-myc to induce lymphoma 50,51 , is required 

15 for the maintenance of adult HSCs and leukemia cells. Thus, many genes involved in 
stem cell fate decisions are also involved in malignant transformation. 

Two other signaling pathways implicated in oncogenesis in both mice and 
humans, the Wnt/p-catenin and Notch pathways, may play central roles in the self- 
renewal of both normal and cancer stem cells. The Notch family of receptors was 

20 first identified in Drosophila and has been implicated in development and 

differentiation 52 . In C eiegans, Notch plays a role in germ cell self renewal 53 . In 
neural development transient Notch activation initiates an irreversible switch from 
neurogenesis to gliogenesis by embryonic neural crest stem cells 10 . Notch activation 
of HSCs in culture using either of the Notch ligands Jagged- 1 or Delta transiently 

25 increased primitive progenitor activity that could be observed in vitro and in vivo t 
suggesting that Notch activation promotes either the maintenance of progenitor cell 
multipotenitality or HSC self-renewal 54,55 . While the Notch pathway plays a central 
role in development and the mouse int-3 oncogene is a truncated Notch4 56 , the role 
for Notch in de novo human cancer is complex and less well understood. Various 

30 members of the Notch signaling pathway are expressed in cancers of epithelial origin 
and activation by Notch by chromosomal translocation is involved in some cases of 
leukemia 57-61 . Microarray analysis has shown that members of the Notch pathway 
are often over-expressed by tumor cells 58,59 . A truncated Notch4 mRNA is expressed 
by some breast cancer cell lines 62 . Overexpression of Notchl leads to growth arrest 
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of a small cell lung cancer cell line, while inhibition of Notchl signals can induce 
leukemia cell lines to undergo apoptosis 52,54,63 . Work by Miele and colleagues 
showed that activation of Notch- 1 signaling maintains the neoplastic phenotype in 
Ras-transformed human cells 64 . They also found that in de novo cancers, cells with an 
5 activating Ras mutation also demonstrated increased expression of Notch- 1 and 
Notch-4. 

Wnt/p-catenin signaling also plays a pivotal role in the self-renewal of normal 
stem cells and malignont transformation 65 " 67 . The Wnt pathway was first implicated in 
MMTV-induced breast cancer where in deregulated expression of Wnt- 1 due to 

0 proviral insertion resulted in mammary tumors 68,69 . Subsequently, it has been shown 
that Wnt proteins play a central role in pattern formation. Wnt-1 belongs to large 
family of highly hydrophobic secreted proteins that function by binding to their 
cognate receptors, members of the Frizzled and low-density lipoprotein receptor- 
related protein families, resulting in activation of P-catenin 43 ' 58 » 65 « 70 ' 71 , i n the absence 

5 of receptor activation, p-catenin is marked for degradation by a complex consisting of 
the Adenomatous Polyposis Coli (APC), Axin and glycogen synthase kinase-3P 
proteins 58,67,72 ' 74 .) 66,75 . Wnt proteins are expressed in the bone marrow, and activation 
of Wnt/p-catenin signaling by Wnt proteins in vitro or by expression of a 
constitutively active P-catenin expands the pool of early progenitor cells and enriched 

0 normal transplantable hematopoietic stem cells in tissue culture and in vivo 25,67,72 . 
Inhibition of Wnt/p-catenin by ectopic expression of Axin, an inhibitor of P-catenin 
signaling, leads to inhibition of stem cell proliferation both in vitro and in vivo. Other 
studies suggest that the Wnt/p-catenin pathway mediates stem or progenitor cell self- 
renewal in other tissues 73,74,76,77 . Higher levels of p-catenin are seen in keratinocytes 

5 with higher proliferative potential than those seen in keratinocytes with lower 

proliferative capacity 73,74 ' 77 . Like their normal hematopoietic stem cell counterparts, 
enforced expression of an activated P-catenin increased the ability of epidermal stem 
cells to self renew and decreased their ability to differentiate. Mice that fail to express 
TCF-4, one of the transcription factors that is activated when bound to p-catenin, soon 

0 exhaust their undifferentiated crypt epithelial progenitor cells, further suggesting that 
Wnt signaling is involved in the self renewal of epithelial stem cells 43,76 . 

Activation of P-catenin in colon cancer by inactivation of the protein 
degradation pathway, most frequently by mutation of APC, is common 43,58 ' 66,75 . 
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Expression of certain Wnt genes is elevated in some other epithelial cancers 
suggesting that activation of p-catenin is secondary to ligand activation in such 
cancers 65 * 78 " 83 . There is evidence that constitutive activation of the Wnt/p-catenin 
pathway may confer a stem/progenitor cell phenotype to cancer cells. Inhibition of (3- 
5 catenin/TCF-4 in a colon cancer cell line induced the expression of the cell cycle 
inhibitor p21 cip " 1 and induced the cells to stop proliferating and to acquire a more 
differentiated phenotype 83 . Enforced expression of the proto-oncogene c-myc, which 
is transcriptionally activated by p-catenin/TCF-4, inhibited the expression of p21 c,p ~ 1 
and allowed the colon cancer cells to proliferate when P-catenin/TCF-4 signaling was 

10 blocked, linking Wnt signaling to c-myc in the regulation of cell proliferation and 

differentiation. Although many studies have implicated the Wnt/p-catenin pathway in 
breast cancer, activating mutations of p-catenin are rare in this disease and no studies 
have definitively linked this pathway to human breast cancer 84 " 89 . 

The implication of roles for genes like Notch, Wnt, c-myc and Shh in the 

15 regulation of self-renewal of HSCs and perhaps of stem cells from multiple tissues 
suggests that there may be common self-renewal pathways in many types of normal 
somatic stem cells and cancer stem cells. It is important to identify the molecular 
mechanisms by which these pathways work and to determine whether the pathways 
interact to regulate the self-renewal of normal stem cells and cancer cells. 

20 The Wnt pathway is involved in the self-renewal of normal stem cells and 

activating mutations of Wnt induce breast cancer in mice. This pathway plays a role 
in tumor formation by human breast cancer stem cells isolated from some patients. 
Furthermore, evidence suggests that the ability of different populations of breast 
cancer cells to form tumors differs. Interestingly, the expression of members of the 

25 Wnt/Frizzled/p-catenin pathway are heterogeneously expressed by different 

populations of cancer cells and expression of particular members of the pathway may 
correlate with the capacity to form tumors. 

The different populations of cancer cells and tumor cells drive the 
proliferation of breast cancer cells. Activated p-catenin is seen in the cancer cells in a 

30 significant number of patients. The tumors that contain cancer cells with this pathway 
constitutively active behave differently than those without constitutively activated p- 
catenin. 
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II. Xenograft model of human breast cancer 

Although cell lines have led to remarkable advances in our understanding of 
the molecular and biochemical changes in cancer cells, their use in the identification 
of effective cancer therapies is somewhat limited 90,91 . Cell lines are imperfect 

5 predictors of drug efficacy in de novo tumors 90,91 . Several factors likely account for 
this deficiency. Cancer cell lines are selected from a sub-population of cancer cells 
that are specifically adapted to growth in tissue culture and the biological and 
functional properties of these cell lines can change dramatically 92 ' 95 . Furthermore, 
cancer cells from only a minority of breast cancer tumors establish cell lines or 

0 xenograft tumors 96,97 . The phenotypic and functional characteristics of these cell lines 
can change drastically relative to their properties in vivo 94 . For example, the marker 
expression of both normal hematopoietic and leukemic tissue culture cells can change 
rapidly in tissue culture and often does not reflect that of the original stem cells from 
which they were derived 92 ' 94,95,98 . Even when conditions are devised to permit the 

5 proliferation of normal stem cells in culture, the conditions often promote self- 
renewal or differentiation in a way that prevents the stem cells in culture from 
recapitulating the hierarchy of cell populations that exist in vivo. Taken together, 
these observations suggest that the biological properties of cancer cell lines can differ 
markedly from the cancer cells from which they were derived. This likely explains at 

0 least in part why the cell lines often are poor predictors of a drug's efficacy in the 
clinic. 

Thus, the lack of an effective method to consistently grow primary human 
breast cancer cells in vitro or in vivo for long periods of time has severely limited our 
ability to understand the biology of this disease. The most efficient xenograft models 

:5 report the engraftment of pieces of breast cancer tumors in the ovarian, but not 

mammary, fat pad of SCDD mice approximately 60-75% of the time 99 . Engraftment 
of dissociated cells is not possible in this model, and cancer cells isolated from pleural 
effusions only form tumors in immunodeficient mice approximately 10% of the 
time 90 . The present invention (see Example 1 below) provides a xenograft model in 

10 which one is able to establish tumors from primary breast tumors via injection of 

tumors in the mammary gland of severely immunodeficient mice. These Xenograft of 
the present invention allows one to do biological and molecular tests to characterize 

the clonogenic brent cancer cell ai will ai ether cell types. Importantly, the 

Ktnos* »ft tumon dtv»loptd in •opordano* with th« protm invention contain tat 
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phenotypically diverse cancer cell types found in the human tumors from which they 
were derived and the different populations of cancer cells differ markedly in their 
ability to form tumors 100 . 

The development of an efficient xenograft model in accordance with the 
5 present invention (see e.g., Example 1), has for the first time reliably allows 

dissociated solid tumor cells obtained from a patient to form tumors. Importantly, this 
enables one to routinely analyze biochemical pathways in an individual patient's 
cancer cells and to do molecular manipulations that allow one to understand the 
cellular consequences of specific genetic pathways on tumor formation by de novo 
10 human solid tumor cancer cells. 



III. Solid Tumor Stem Cells Cancer Markers 

The present invention provides markers whose expression is specifically 
altered in solid tumor stem cells (e.g. up regulated or down regulated). Such markers 

15 find use in the diagnosis and characterization and alteration (e.g., therapeutic 
targeting) of various cancers (e.g. breast cancer). 

Example 4, provided below, describes methods used to identify solid tumor 
cancer markers. Preferred cancer markers are provided below in Tables 4-8, as well 
as Notch 4. While these tables provide gene names, it is noted that the present 

20 invention contemplates the use of both the nucleic acid sequences as well as the 

peptides encoded thereby, as well as fragments of the nucleic acid and peptides, in the 
therapeutic and diagnostic methods and compositions of the present invention. 



Table 4 

25 Up Regulated in UPTG versus UPNTG 

S100A8, KRTI8, CEACAM6. IF1TM2, HLA-C, S100P, S100A9. H2BFT, HLA-C, FXYD3, S10OA10, KRT19, TUBB, HLA- 
DPA1 , CEACAM5, LCN2, FTH1, RPS26, IFITM2, S100A7, CAP, HUMMHCW1A, HLA-DRB3, CD63, S100A6, HSPBI, 
HLA-B, MGLL, PTS, HLA-A, RAI3, DAF. UBC, HLA-A, KDELR3. SERF2, CTSB, CEACAM6, PDLIM 1 , SHC1 , GOLPH2, 

30 GABARAP, AQP3, COL3A1. AHCYL1, FXYD3, ITM2B, BF, RBMS1, DUSP1, PSAP, ARHGDIB, ENOI, ATP6V0E, 
MUC1, RARRESl, CD81 , TR1M44, ASS, CD59, PRG1. HLA-E, TXNIP, INHBA. CSTB, H2AFO, HLA-DRB4, RAB31, 
P4HB. LOC92689, B2M, CSNK2B, MGST3, DKFZp564ll922, C4B, UCP2, FN1, COL1A2. LOC51 186. LTF, TlMPl, NPC2, 
TSPAN-1 , GOL1A2, SLPI, CIB1 , IQGAP1 , SPARC, FNI,CCNI, SPTBNI. H2AFO, BTN3A3, FN I , SEPX I , GFPTi , 
ANXA11, CD74, RAB25, APP, PSEN1, IFI27, FHL2, CPB1, BACE2, PSMD8, LGALSI. PLAT, EIF3S4, ANXA2P2, PILB. 

35 IFI30, ATP6V0E, LOH1 1CR2A, LBP, HLA-DRB1, MIC2, OPN3, SVIL, FDFT1, PTGIS, ORMDL2, PIG7, ERBB3, GSN. 
FN1.GOT2, BCL6. WBSCR21, ANXA1, CLU. PIK3R3.TNFSF10.NBLi, PEX11B. CDKNIA, SAS, RIC-8, RABACl, 
ADD3, ARPC5, GUKI, NQOl. FER1L3, PPAP2A, TSPAN-3. PLOD2. TGM2, LOC51760, TST. TM9SFI, LGALS3BP, 
Cl4orfl, D2S448, OPTN, GPX1, MBC2, PTGES. DPYSL2, PEN-2, DAGI,GM2A, DKFZP564G2022, FAT. SLC21 AM, 
ACADVL, ABLIM1, H LA-DP Bl, COP A, PPP1R7, DAF, SSBP2, TES, MUCI6, PPL, MGC10765. SECTM1, C3, NNMT, 

40 ARF3, SEPWI, H1F2, SERPINB1, KIAA0746, RDGBB, ELF3.TUBB4, VCAMl. FOXOIA, EGFL6. ATP1AI, PLS3. LMNA, 
TGFBI, DD96, GLRX, PROSC, IL1R1, SERPINB2, KRT7. RGS16, TNFAIP1, SYNGR2, PAFAH1B3, GPI, C6orf37, ATF3. 
H LA-DMA, FU22418, DCN, FOX03A, HLA-DQB1, CPD, DF, HTAT1P2. MUC5B, CTSB, PBEF, HI 1, CAPNS1, Z39IG, 
MAGED2, TNFSF13, HLA-DRB3, H2BFQ, SGK. P4HA2, VPS28, NDUFB8, PON3, ENSA. EDF1 , SERPINB6. FDPS. RGS3, 
CREB3, PRNP, YWHAB, A2M. HLA-DQB1, PDGFRA, CLMN, INHBB, SURF!, NF1L3. S100A1 1, HPGD, CLDN7. DAB2, 

45 NT5C2. PLXNB2, GSTP1 , AP2B1, COL3A I . HRMTI L! , SRPR, RNASE6PL, ANXA8. PROML1 , C 1 S, GALNT6, BAT3, BC- 
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2, GLS, CD14, FYCOl, SQSTM1, CSPG2, DEFB1, BAT3, GALNT2, SPARC, WT1, DUSP6, MONDOA, MACFI, ATP2C1, 
THBS2, CD53, PGM3, HLA-DRB6, COL1 Al, SCAP2, KIAA0436, CYR61 , TNFSFI3. SLC6A14, CUGBP2, LAMP1, CCL22, 
CLU, CD163, ANXA3, MBLL39, IL4R, SERPINB1. CNP, TUBB4, FU20265, MAFB, EFEMPl, DPP7, SYNE-2, PLSCR1 , 
PDE4D1P, P2Y5, RAGA, SIAT1, N4WBP5, SPUVE, BPAGi, DEPP, BASPI, CTSB, HLA-E, KIAA0308, GAS1 , ABR, 
ABCAI, CRN, WDR1. PM5, CYFIP2, SGP28, FLRT2, ACACA, LUM, FU21432, FEM1C, RJN2, PCDH7, SLC7A7, 
FLJ21347. SOX9, MB, S100A8, DAP, MVP, SPPI, TM9SF1, DOC1, COL5A2, RNF24, GLBl, GRN, HLA-DRB5, ENPP2. 
CSGlcA-T, KIAA0937, H2BFT, JUP, KYNU, APOL6, GM2A, Clorf24, SYNGR3, CX)L6A1, CRYM. LXN, FARP1 , plOO, 
ANK1, NPC1, RBPMS, VLDLR, ARHC. UBE1, HDLBP, LYZ, DCN, PLAB, SERP1NE2, EGLN3, FSTL1, LAPTM5, 
TR1M29, ACTN4, MUCl, SH3GLB1, BIK. ZNF91, CLIC4, NARF, LIM, SLC1 Al, K1AA0746, APOCI, TYROBP, FLNB, 
EMP1, UBE2L6, KRT6B, MAN2AI, GCN5LI, APEH, F-LAN-1, PRKCZ, CDI63, HLA-DQA1 , KJAA1668, MUC5B, LAIR1, 
BCL2L13, CXX1, MPZL1, NR3C1, AHR, FU12389, ATP6V0C, MD-1, H2BFA, HSPC023, OSBPL8, ZNF36, TRJM14, 
UGTREL1 , CTSL, COL5AI, PDGFC, UBE2N, SF1, ARHGEF10, SH3GLB1, HLA-G, KIAA0084, HT0I2, SULF1, TTC1, 
UBAP1 , PGLS, M6PR, TEM7, NPR2L, GRN, EXT2, DCN, HLA-DMB, HLA-DQB 1 , NAGK, MMP19, LBP, ATP10B, CLN3, 
SP100. CSPG2, VIM, IGFBP3, ANK1, DUSP3. STAT3, CED-6, KJAA0I96, SOX9, NKX3-1, TGFBR2,CAV1, TREMI, 
PTD009, GPX2, LAPTM5. HSPC022, SSA1, ABS, CPD, DXS9928E, DUSP6, PGBD5, CNN3, PIP5KIB, FU13840, CLDN4, 
ABCA3, BPAGI, CAPZB, PPIB. ACTA2, CDH 1 1, FIJI 081 5, HLA-DPA1, FU20539, MUC4, CAV2, ACAA2, CEACAM1, 
GALNT10, MYO10, C9orf9, PAM, C6orf29, MGC:5244, RctSDR2, ATP2B4, DHCR7, GP, LOXL2, MIR, DCTD, BCKDK, 
RTP80I, KIF1B, ENTPD3, PAFAH1BI. LGMN, UBE2L3, PTPRH, RPS6KA2, ALDH1A2, FHL1, GALT, AP1M2, MAF, 
C4BPA, POLR2J, KIAA0790, TM4SF3, HPGD, THY1, NCALD, PADI2, KIAA0557, SM ARC A 1 , CD83, AZGP1, SMARCA1, 
MRPSI 1, RAGD, PIGB, FYN, TM7SF1, HLA-E, BRE, PLA2G4C, NOS1, 1D3, HLA-DQB 1, SSSCA I, PPP1 R14B, HLA- 
DPA 1 , ANK1 , PRKCH, CALU, PEF, DOK5, COL9A2. ATP2C 1 , DPH2L1 . MUC5B, LOC 11 3 1 46, NDN, PIG3, HLA-DRA, 
GPS2, CX3CL1, C1QB, TGFBR3, APOCI, BIN1, CBR3, TGIF, EFEMP2, SCDGF-B, TUBB-5, MAP4K4, CCL3, CCR1, 
RNF10, RGL, CDIC, FBLN1.GW1 12, ALTE, ALP, PLAC1 , ISG20, PTE1, NPD009, LOC55893, AP3B1, PRKAR2B, KRT9, 
COPZ2, LYN, FU21478, DKFZP566C243, NUM A I , ANAPC5, FU10134, ADPRTL1, ITGAM, PIP, FU22559, IFI16, 
TMPRSS4, HAIK1, PCSK7, ANK1, FCER1G, IMPA2, HLA-DQA1, 1FNAR2, NEOl, PRKCQ, SMARCD3, CECR1, 
FU1 1286, TBC1D1, MS4A6A, Clorfl6, LRRN1 , MRPL23, PUM1, SMA3, PDE4B, SLC22A4, MMP2, ICA1, SLC22A1L. 
RRP22, GBA, TMEM8, DUSP2, TREX1, SLC6A8, C3AR1, BSCL2, ARFGAP3, TRIM2, SERP1NB8. TNFRSF6, LDB1, 
CCND2, RGS2, MEIS1, HRJHFB2122, IF, P1P373C6, UPKIB, WDRIO, CG1-49, PSMB8, RARRES1, SLC16A1, DPYD. 
DNPEP, FU20254, COL5AI, FU1 1017, CCR5, MX2, PIAS1, CAPG, CDC42EP3, IL1RL1LG, SCGB2A1, RNH, INPP4B. 
B3GALT4, PLAU, DFNA5, KJAA0852, CR1P2, TIP-1 , ZNF142, HSD1 7B2, MYOI B, PCOLCE, FU22169, APOE, DAB2, 
CXCR4, NAG, SNCAIP, GBP1 , ASRGL1 , SLC6A8, REC8, SLC7A1 1 , CPE, MPZL1 , TD02, GALNT12, CDKN2A, 
KJAA1395, LGALS8, FLNC, NPR2L,GRB10, MGC15523, PTPRC, CAPN9, IFI16, NBL1, CRYL1, PSMC2, IGF1, BIN1, 
HNOEL-iso, DKFZp566O084, FGB, GPNMB, TLR5, FU20686, UROS,CX3CRl, HCAI 12, PRKCB1, BD1CRB2, CLTB, 
KIAA0652, KIAA1668, DCN, HLA-DQB 1 , C6orf9, CPR8, T1MP2, PSMB10, LTBP2, FLJ20452, HTATIP, LAMA4, GLUL, 
SH3BP2. HES2, KJAA1 115, KDR, PROCR, TNFSF10, FGFR1 , ELF4, F8A, BAG1, COL5A1 , THY1, H2BFG, TOSO, KRT15, 
AIFI, LY75, KRT17, CEACAM1. GAK. AGTR1. ASB8, K1AA0792, CDKN1C, C1R. PTGSl, TM4SF6, XT3, HLA-B, 
DKFZP434B044, ALDH1A3, NID2, U2AF1RS2, H2BFL, FUT3, PVALB, ITPR3, PODXL, QPRT, PTRF, PSMC4, ACATE2, 
MAP2K3, ATP2B4, CEACAM 1 , CALB2, TTR, TRIM38, JM5, FU2 1 ! 35, FU2322 1 , FU20452, GATA6, RABL4, KJAA 1 1 99, 
1GFBP7, MGC14376, CITED2, CASP4, MEIS2, PHLDA1, OXA1L, ILIRL1, FLII, EFEMP1, PYGL, LM04, GPR3, G1P3, 
APOE, ZNF193, AP1S2, PTGDS, TEM7, LOC51279, SLA, BTG1. INE2, WIT- 1, LBH, CXCLI, RAB31, POMZP3, COL6A3, 
EXTL3, MGC4309, LOCI 14990, KYNU, NAB1, CYP2J2, SMURF1, BRAF, HLA-DQAI, CAV1, KIAA0779, CHKL, SEC6, 
CGI I, FU20920, CGI-49, EIF3S10, P4HB, GYG, DYRK2, DKK1, MAF, TRIM 22, CENTA2, FU201 13, NR3CI, CYP1B1 , 
HSD1 1 B2, RRP46, FOLR1 , HHLA1, THY28, H3FB, FOS, GAA, FLJ 1 3 1 7 1 , RHOBTB3. ZNF32, HOXA5, CFLAR, PAX6, 
KIAA0076, CTSS, ALOX 1 5B, PCOLN3 , P3, AKRt B 1 . LOXL1 , H I F3, BIN 1 , GMDS, FU 1063 1 , SI AT4A, PIM 1 , LRMP, SL1, 
TFPT, RAGD, DSCRIL1, SETMAR, K1AA0657, GPRC5B, TIMM22, ARHGEF6, H2BFA, PPFIBP2, SALL2, FU21820, 
ABCD1, CPA3, SNX7, CUTL1, PALMD, ERCCI, MSTP9, PTPN3, GAL3ST-4, C6orf9, PTPRT, RGC32, AD-017, CRELD1. 
FU10097, RNASEI, S100A4, RORC, CMAR, USF2, FU13544, CASP3, SMUG1, RAF1, MYL9, GFR, PDGFRA, DPP4, 
ARL7, SLC3A2, RHD, FGL2, RBMS1, EGFR, PRO1580, FCGR3A, PTENP1, H4FH, MSCP, CSGlcA-T, ADAMTS5, 
TNFAIP6, PRKCDBP, PRKG1, CAPNI, OAS1, H2BFH, SCHIP1, FU21736, BMP1, IQGAP2, KRT5, LM02, HIC, PLAGL1 , 
AQP6, ZNF42, PHLDA1, YBX2, INPP1, CHST6. MGC4171, PL6, SPPL2B, EPHA2. CRYAB, MST1, ZNF21 1, MD-2, CRI1, 
KJAA0057. PACE4, LOC93349, RALGPS1A, LAMB3, HLX1, RIN3. SERPINB5, PLD1, DLC1, PIPOX, PTHR2, UBE2G2, 
CH13L2, K1AA1 111, TGFB2, PLAUR, ID1 , ALOX5, IGFl, REPS2, CDH2, BCHE. SNFT, FLM1286, MAPRE2, MAOA, 
SERPING1. PTGER3, KIAA0602, PGM 3, MATN2, DNASE1L1 , PGD, FZD2, PPAP2C, GOLGAI, ADAT1, TEX13B, MGP. 
FU20084, ART1 , EV12A. SART2, RFXANK. FBLN5, DPYSL3. ZNF1 87, RBMS1 , MLN, NRXN3, WASF3, DSC3, PPAP2A, 
EEF1A2, UBE2H, GABRQ. TFEB, MGC3123, GFPT2, WIG1 , FBLN1 , PTPRF, MEPE, SLC6A8, ILIB, GAC1. EPHX1, 
Cllorf9, OSF-2, FU101 11, SRPX, DAPKl, RBM10, MBD4, MECP2. ILVBL, KIAA0375, JAM3, PRSS25, KIAA0913, 
TNFRSF6. CSRP2, CCL4, C20orfl9. CA2, SLC7A8, BNC, PHEMX, ADAMTS1, XRCCl, PEMT, H2AFA, NEU1, OPTN, 
NRP1, TPM1 , W1SP3, GPX6, MRPL2, HP, BIKE, PLXN3, FACL5, MGC15419, FU1 1506, GLS. MAPK7, KIAA1053, CDH3, 
CST3, K1AA0752. RORI, TAP2, SBLF, AKAPI3, USP2I, PP35, ELOVLI, CYBA, KHSRP, MRC1, FU12057, H2AFN. 
MSN, TPM1, SLC16A3, ADD1, IL1RAPLI, SPTAN1, FU 10847, SNAI2, FU12986, GSPT2, FU10450, MAN I CI, MEF2A, 
VEGFC, RANBP3, MGC17330, SCD, F5, PIK3CD, SELPLG, LOX, VAX2, MSF, RANGAP1. BIKE, ARHGEF7, FU20300, 
MYLK, GMPR2. CENTD2, PPP1R9A, ANG, DNAJB2, IDH3G, ODAG, ADPRTL3, COG7, KJAA0429, NEDD4L, ALEX2, 
ATP6IP2, PTGES, MAN1B1, CYP3A43, AP3S2, DEFA6, PTGER3, FCGBP, CPSF1, NNMT, HAMP, CGI-38, BAZ2A, HLA- 
DRA, SPI 10, CA5B, UBEIL, BTN3A2, KIAA0842.TIA-2, PTGER4, PTGDS, MARCO, EPB41LI, IL13RA2, CXCL6, 
APOAI, NPAS2, ETV5, HFL3, EPB4IL3, CHI3L1, SSB1, EVI2B. K1AA1608, ME1S3, FU 1 3385, NQO 1 , BGN, MOX2, 
<JJ222E13.1, GMFG, TBC1 D2, SKIP, RABGGTA, MRPL28. FU21034, CRY2. SLC4A2, MGC20727. HAP1 , CYBB, GRIT, 
PTN, FUT2, CDSN, STAF6S(gamma), BENE, ENPP2, PAK4, CUBN, ICSBP1 , NPAS2, FU23516, FU23537, AADAC, 
MFAP2, ERCC4, STKI3, MCAM.GPR65, CYP17, FU20373, TNS, TRA1, NPY, PTPLA, PNLIPRPI, RBMS1, TM7SF2, 

MKL1.NCF2, AP4M1» IT0B4, SLC1 1 Al, PSCDBP, NPB2L3, ELAC2, CBFA2T1, SI00A12, PACE4, KIAA1395, HLA-G i 

MALTI.NDUFS8.ILI0RB.TCF3. HLALS, DKFZp76IKl423, DDX8, G0S2, SLC16A3, CCL18, ZDHHC4, FKBPIA, HRHI. 

G§A7, PTPRM, HBPI7, APPPPg, TNRG15, (Ml, PSME3, HFy, BGfcUi, SCARA3, APBQ1, kHFp, IGFl, PPGFRL, 

MUCt 3, tCtf!, NXF2, HRMtl AfcHD, »O\A0Si2, K1AA0977, FCNI, LAMPi, DNAJC6, ALBH3BI, TNK&, MAPK3, 
PLJ13491 , APOA1, RBP4, OAI3, CLTB, QM, MIDI, FOR, OUCI , PP1044, MAP.CHODU FUHI 73, TPD33L3, DD3, 
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PSIPl , HSPB7, EMP3, KRT6A, C5R1 , EN02, PF4, SYN1, PLSCR3, HMGCS2, BCAR3, LOC51693, ANGPTL2, TAHCCP1 , 
LOC51063, KIAA0561, GJB3, CPVL, PCBD, CG1-96, PKIA, NR3C1, GAS7, FBNI , MPV17, SLC21 A3, ARHGAP6, FMOl, 
CSPG2, FU22531, STX7, SCN1B, TETRAN. FGF23. CLECSF12, CDKN1C, HFI.GSTTl, V1LL, BLAME, RODl.TAPBP- 
R t HLA-G, HT01 7, CHP, SLC25A10, LST1 , FIJI 1 196, VAMP2, NR0B2, CSNK2A1, SLIT3, MAPK7, CXCL2, GYG2, PGS1 . 
5 CDYL, VNN2, CLN5, NPAS2, MLL. TRPM4, LYPLA3. MY07A, PSMB1. PAFAH2, PITXl, GRB10, TIMELESS, 

APOBEC3G, KIAA0819, GALNTIO, PTPRO. NMB, FU12298, RAMPI , OR2F1, HPGD, CALBI, CCR7, ICIAAI614, 
SLC2A3, OLFM1. DKFZP564G202, FEZ1, AKRIC3, ACADS, CALBI, PIK4CB, FOXA2, FU2058I , RRAS, BHLHB3, 
HUNK, MLLT3, RBMS2, KIAAO620, SLC29A2, SIRT5, SLC27A2, FU21458, DTR, ACTNI, KIAA0429, SLC2IA9, 
FU10211, LOC63920. FU12377, ARPC4, TSSC4, MEF2D, RPL10, NOV, CG1-72, FAIM2, TBX2, GABRD, Clorf24, 
10 MGC2615, NR1H3, FU 14675, AQP5. ZNFN1 A3, SSPN. SIGLEC7. COL5A2, H LA-DOB, SLC12A3, Apg4B, HERC3, 
HEM1, EBI2, ZNF323, FU20950, FASTK. C6orf32, LILRB2, SPP2, DHPS, UBE2B, MET, ST14, EGR3. SIGLEC5, 
SAMHD1, PGCP. PTPNSI, SPARCLI, FU22160, RANBP2, IL15RA, OXT, FU21 168, PTPN14, BA1AP3, TPM4, NCR3, 
TEK, H2BFE, SLC34A2, SLC26A2, KIAA0870. MET, SENP3, PTGER4, CGI-48, PDGFB, CD86, GTF2H4, KIAA0053, 
PTX3. BIMLEC, CAMK4, PROS1, AOX1, K1AA0931, COL4A1. USF2, PLINP-1, TM6SF1, PTPRG, SNX17, SLC5A4, 
15 MSTP032, PCTP, PQBP1, CDV-1, AD037, RNASE6, SNAll, KIAA0872, MEF2C, ZNF3. LOCI 57542, FCER1A. PRB1. 

SIRT3, DKFZP434K046, ABCC6, NPC1LI , BCL2A1 , LOC64167, GS3955, UP, CLECSF6, MGC20727, CHN2, CD3D, BAD, 
KJAA0435, PECAM1 , IGSF4, BCAS3. C8A, ZNF131. MGC10771, SEC14L1. SERPINHI, IL1F6, KLK1 1. THBD, FKSG28, 
KIAA0173, HKE2, PFTK1, FU1 1560, APOL1, CHRM4, ALLC. MS4A4A, SLC1 Al, BBP, ILT1 1, SAMSNl, IGF2R, 
FU20421 , PBX2, MAPI LC3B, 37872.00, NCK1 , FGFR2, CD86, FU23506, SCD, FCGR2B, CYP4A1 1 , S100A2, AP2S1 , 
20 PLAGL1, PTG1S, PCOLCE2, SLC2A3, DKFZP761N09I21, GPR105. OSBPL3, RPLP2, DKFZP586I2223, CD36, BBOX1. 

VNN3, AKR1B10, ZFHX1 B, DKFZp434H22l5, RoXaN, RSN, GALNS, PROSC, PCDHA3, PLXNA2, CCR8, BACH1, NPAT, 
SPAG6, DGCR13, CAPN5, OSBPL3. CYP-M, FU13902, FU13659, ADAMTS3, IL1RAP, ELF1, HYAL1 , WNT2, CCS, 
TREM2, K1AA1036, FU20574, FU132I5, CUGBP2, FU20010, GABRE, RCE1, SCIN, HLALS, MGCI0940, ADARB1 , 
PLA2G7, KIAA1237, KIAA0889, FU22593, CD244, NEK9, TAT, RAP1GDS1, SMA5, MYH1 1 , APAA. MERTK, GJA4, 
25 TNFRSF1B, MRPS12, HSF1. COL1 1 A2, DAB2, PCQAP. WDR4. ABCA8, CLPS, ARHN, PHF3, AKAP12, LST1, 

MGC12904, FU1 1539, ZFP36L2, SERPINF1, MGAM, PRG4, RAB5EP, CASP2, DIPA, AQP3, VAMP5, DXS1283E, 
COL4A2, MMP10, CD97. MGAT3, FCN2, KIAA0475, FGF9, CTSZ, SQV7L, H326, PLD3, TRPC1, OR7E24P, GRIA2. KIP2, 
BARX2, MHC2TA, RECQL. NUP214, DHRS2, P2RYI, KIAAU55, HLA-DRB4, CAPN6. TLR7, AHCYL1 , TRGC2, NEB, 
POU2F1. CPSF1 , APOB48R, CLDN9, FU21276, AEBP1, MN1, PKD2, PACRG, CALM 1 , TSPAN-3, KIAA0233, ATP6V0E, 
30 TRIM34, DKFZP564JI02, CNOT8, STC1, NFE2, FCN3, CKIP-1, PLA2G4A, TRGC2, DES, CDC42EP2, HSD3B1, CSN10, 
PRKACB. RDH5, CDW52, XYLT2. HPN, WIZ, GOLGA2, CSHL1, GLRX, PCDHB1 1. TNFSF18, fCLRDl, 384D8-2, 
WHSC1, TNFRSF10C, EVPL, TNFRSF5, S1AH2. GYPB, PMMI, DPYSL3, FU14297, ZNF42, BSN, OMG, AXL, ACK1, 
PKD2, KIAA071 1, FU00060, GUCA1A, PAPPA, CBLN1, FRCP1, BTD, FU20591, FGG, CXCL14, NPR1, CAMK2G, 
HLCS, SECP43, BCAT1, MSR1 , IGFBP4, Cl3orfl , PR02577, KIR2DL4, BAALC, FU21919, CNTF, LOC51295, ENTPD1, 
35 TAPBP-R, CAP350, PKD2L1, EVX1, NR1H2, FU13868, ERCC3, DKFZp434L0850, NR3C1, DMD, BST1, CARD15, SKD3, 
CASP1 , PCDHA6, NR4A1 , HAS2, COPEB. R29124 I, THPO, AQP6. MGC 10848, RAB6B, ABP1, APOB. UTRN, MICA, 
SSTR4, FU23056, C6orfJ2, ROM 1 , FU90005, KCNN4, MGA, HSPC219, CGEF2, CDC42BPB, CCR4, GLS, MAGE-EI , 
PILR(ALPHA), PGK2. KIAA0657, SF3A2, NOTCH4, CLECSF2. FBLN2, B4GALT1 , WNT2B, NRBP, LTB, FU2202 1 , 
CDH6, TUBGCP2, GCN 1 L I , ZIC4, HR44, AGA, SLAT9, EM PI , EPOR, IGKC. TAHCCP l.PECR, FLJ2I477, EDG 1 , MS4A2, 
40 BCAS4, FU22404, DPYS, PRCC, POLD4, BIKE, GAS7, KIAAI0OO, ZFP, WNT7B, MUC4, FU10477, CD1 D, MGC4614, 

CCR1 , NEU3, SIX3. FIJI 0640, GPR51 , STOM, SERPINE1, HLA-DQB 1 , PTN, DNCLI2, EN2, FU20378, IFP38, LOC90326, 
IGU3, NCYM, KIAA1 107, GP2, PLAUR; CD47, BFN1, MGC14799. IGFBP1. SSXI. IDUA, RECK, CD6, IGHM, ADD2, 
AKAP2, HSF4, MDS032, FU20086, TNXB, IGFBP3, KLKB1, PRB4, KCNF1, PDE9A, SIPAI, SMARCB1. COL4A6, 
PDE10A, NFATC1, CDH16, COL6A1. ZNF272, LDB2, HCRTR2, B1 , ATP 12 A, FU1 1710, LOCI 161 50, KIAAI049, 
45 HSPC157, FU20701, IGSF6, TOMM22, TGFB1 , PTGER2, CHML. FA AH, COL6A1 , DGUOK, LRRN3, B7, K1AA0876, 

Clor«2, CYP2A13, CXCL5, CD5L. FBXL6. GALNT2, GJA10, COL15A1, TEX13A, 7h3, TRD@, RIL, OTC, SAST. KLF8, 
TUBA8, MGC45806, FLU 3479, GRP. LRP4, CD84, WBSCR14, EPOR. BRAP, ziziminl , DNAJC4, FU20356, SERPINA2, 
FU10432, CD209L, NRP1 , PGDS, PLA2G2A, TNFRSF4, PR02214, DNAJB6, RDHL, FOSL2, DEPP, FU20241 , MMP1 1 , 
HLA-DQB1, RBM10, 8D6A, MAX, CUGBP2, CKTSF1B1, ISL1, CREBBP, ACTA1, NUDT2, ORIA2, GPR86, SH3BP2, 
50 APAF1 , PR01386, 1GL@, EVI5, K1AA0443, MFNG, XCL1 , ITM2A, IGU3. SIN3B, CCL18, NRXN3, AQP7, HLF, SEC14L1 , 
DNM 1 , KIAA055 1 . STK17B, GNS, IL10, MGC20727, COL5A1 , SEMA3B, Ct 1ORF30, CASPIO, ORM2. NPEPPS, CALCRL, 
ALK, SH3BGRL3, FOXD1 , MNDA, LCP2, ANK1 , GSTA1 , FU20856, ALOX15, L1CAM, DRFI , TM4SF9, SLC24A1 , 
NR4A1 , ATP7A, PCLO, TSHR, CAMK1G, MSR1, GL1PR1, K1AA1069, LYN, FLJ00001, MIG2, DLGAP2, TF, SOD2, 
ELMO!,BMP2,SLC12A5, PSG 1 1 , EPB4 1 L3, CAMK2B, TGM4, SCN1 1A,CALU, Fl 1,GPR75, KIAAI053, S1X1 , WBSCR5, 
5 5 RIN3, CCNT2, CABIN1 , NR2C2, TRPM 1 , ABCD2, VDU1 , FU208 1 1 , GJB3, ASAHL, RAB 1 A, HAND1 , BA12, EDG8, 

TNFSF 1 3, HPIP, PTPRN2, PRO061 8, PRKCI, PSTPIP1 , FACL4, ETV4, CACNA 1 D, WISP I . PRLR, FEZ2, CCL25, PCNX. 
SNX10, LILRA2, KIAA1086, MKRN3, PRG1, HGC6.1.I, GUCA1B, RIG, FLT1, HLA-C, KIAA0427, LILRB2, MAP2K5, 
FIJI 1 125, EFNA5. DUOX1, LIG4, MRE1 1 A. DEFB126, DNAJC9, RQCD1, ABCB8, HPR, MRS3/4, KPI2, NR1 13, FBXW7, 
HS3ST3B1, LAD1, SHMT1. C1TED2, DNALI1, POLYDOM, PFKFB4, KIAA1029. UTY, SCAND2, ZNF215, FOSL1, 
60 CDH17, PCSK5, ACE2, ERG, FIJI 1619, KIAA1466, K1AA0675, IL18, FU2I562, BTN3A3, FACL6, FANCA, ANKRD6, 

CALCR, CSF1 , FU 13262, CALR, TFEC, SSTR2, HBD, MGC10986, GTF3C2, HRC. RHOK, KIAA1 117, K1AA0924, ITGB1 , 
DEFCAP, FU 12525, TBXA2R, GLIPR1 , AVPR2, CCNE2, TBXAS1, RGS5, HAGE, FOX03A, SYK, 384D8-2, ABO, 
24432.00, MASS1. PF4VI, CASP5, CNGA1. FU 1 425 1 , SLC9A3, UPK3B, DLG1, COL17A1, PCDHB12, OSIL, HFE, 
KIAA0495, KCNJ15, KJAA0997, RGS11, PITX3, FU13055, UBE2I, PR02176, CACNB4, FOXH1, RASA2, PML, BCAT1, 
65 EDG2, OCRL, ATPAF2, PMS2, POU2F3, PTPN21, SUPT6H, HANU, RORI, COPEB, K1AA1654. DKFZP434B204, TNIP3, 
EPAG, CACNB2, NEK2, XRCC4, IL6ST, TNRC1 1 , CAPN1 1 . 37870.00. PLA2G4B, NPEPL1, RASGRPI , HABP4, CYLD, 
C15orf5, ITGB3, FU23093. NPPC, MCOLN1 , GAD2, TRO, LOC5 1 063. OGN, NR1 H4, MTRR, SS-56, NT5E, C22orf4, 
SLC4A5, SGCG. C8orf1 , LGALS2, ELK1 , TRPM8, MGC2655, NR3C2, PPARG, MXD3, SERPINB3, PRO046I , GNAI1 , 
AVPR2, PEG10, SPINK 1, CLDN1, STC1, KIAA1045, F2, GNG1 1 , FY, H4F2, D21S2056E. CAPZB, K1AA0599, ClorT29, 
70 RGS 12. GCG, NCOA2, FOXL2, UGT1 A8, PKLR. NRG 1 , 1TGA7, CNOT3, SPRY2. PIK3R1 , ZF, PTPRR, KSR, TCEB3L, 
IREB2, PRO0899, PAWR, SOX18, Gene Symbol, RPL28, FU13352, C20orfl 14, PIGR, ERAP140, MY05B, EGR1, 
LOC124220, TCEB2, BACE2, NMES1. KIAA1324, MGC45416, WASF2, APOA1BP, FU321 15, ATP6V0E, TIMP2, H2AFJ, 
C9orf5, RASDl, KIAA1437, H2AFJ, RDH-E2, DKFZp434G17l, GUK1, FU20671 , CAPNS1, KIAA1671, H19. FU23I53. 
NDUFB10, FU13593, GLTP. TLP19, ENPP5, MGC39329, MRPL41, ARF3, LOC51255, HSPCA, BR13, FU14525, 
75 LOCI 13246, RAP2B, FU141 17, GLCCI1, PPP3CA, PHP14, MIR. ADCY4, FU1 1320, MSTP028, Cab45, TNFSF13B, 
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ZNFN2A1, MGC14327, KJAA1404, RAB34, RBMS1, ARHU, SPUVE, LOC54516, SAMHD1, LOCI70394, SAMHD), PIGR, 
CYP4X1, NFIA, KJAA1715, CTHRCl, DKFZp547A023. KIAA1434, MYBBP1A, MGC4248, H4F2, H4FH, NPDO07, 
MGC14839, FU2I791, HDLBP, C8orfl 3, FU23393, FU1 1046, DKFZp434C0328, BCATI, BAT5, FU3I23S, LOXL4, RNF7, 
MGC2803, CLDN! , KIAA2002, STMN3, MY05B, CTSS, ATP1B1 . MGC4309, UBE2H, DKFZp762Hl 85, LOCI 15265, 
5 MGC13045, SH3KBP1, MGC4604, TRIM47,C9orf5, SDCBP2, AP1S2, C20orfl 10, LOC51234, SAT, dJ55C23.6, CKXFSF7, 
PCDHA10, MGC1 1 U5, MGC15397, LOCI 16238, TR1M8, FU25157, NAV1, KIAA 1870, ALS2CR9, GCNT1 , GALNT4, 
HSCARG, PPP1R1B, PHP14, TGFBR3, ARIH2, MGC1842, SELM, AKAP2, MAFB, FU23091, MBNL, TEM8, CFL2, 
KIAA1S54, SEMA4B, FU 10961, SCAP2, KIAA1244, RIG-l, TRABID, TR1M56, MK-STYX, ™EM9, FAD104, GLTSCR2, 
MGC:13379, MGC40555. FU14251, NOL6. FU23499, DHRSX, DKFZP564D1 66, CED-6, LOC57168, K1AAI337, CRB3, 
1 U EMILIN-2. GJB2, ECGF1 , CHDH, LOC120224, ZNF75A, EPST11, NESHBP, FU10210, FBX025, MS4A6A, NOTCH2. 
FU39885, FOXP1, ORMDL2, MGCl 1 134, MS4A6A, HSPC195, KJAA1913, UACA,Clorn3, USP28. LCMRl, GBA2. 
DKFZp547D065, TH1L, RORC, PAK1, MGC2555, KJAA0146, FU20186, SCAMP2, NGEF, C14orf58. CED-6, LOC55893, 
GTAR, MGC24103, MS4A6A, DAGl , KIAA1394, FU20073, MGC131 14, FBX032. CD44, CTL2, ARNT, C21orf63, CLIC6, 
C20orf64, FU90586. RBPMS, LOC51242, MGC45441, CLMN, FU35564, MGC4604, DRCTNNB1 A, CGI-125, 
1 5 DKFZp547A023, MGC39325, CD109, FU23499, EHD3, MGC4840, USP21, DKFZP761E1824, FU22215, 1L17D, 
MGC16028, MS4A7, GALNT2, CDKN2B, LOC905S0, CKLFSF3. FS, KIAA1949, MRPLIO, MGC457I4, MAP4K1 , 
SLC4A1 1, HPS3, DNAJC5, LOC120224. FU1 1036, KIAA1337, FU10697, SENP2, SART1, MGC2474, SCD, FU14486 
KIAA1214, CARD6, KJ A A 1 69 1 , MLL5, C20orfl02, FBXW5, RARA. SLC13A3, FU33817, NRP2, BACE, LOC55971, 
FU14855, LOC133957. GPRI08, MRPL41, MGC10485. CMG2, C8orf2. PIAS3, DKFZp434Gl 18, K1AA1500, APXL2. 

ZU MGC16028, COG1 , UBE2H, CMG2, CTSB, LOC143903, CANX, PAG, CP, FU40432, LOC137392, DKFZPS86F1S24, 

SAMHD1, DKFZp761A052, HSPC002, C20orf23, DKJFZp434N061, SLB, PSMB7, MGC4342, DKFZP434P106, FU22678 
SYTL4, DKFZP566J2046, LOC51249, PARVA, FU23091, YR-29, LOC55893, OGN, CPNE2, KIAA1784, Spir-2, DNAJA4 
TMOD4, FU30726, C9orfl9, SNX8, DUSP16, FU34633, FU25785. OSAP, B2M, DERMOl, ZNFN1A4, SCYL1. C16orf44, 
MAF1. MGC12435, MSCP, JAK3, PPP1R16A, MGC4607, G6PT1, MGC16212. FU22283, SRA1, HBP1.CTL2, HCC-4 

25 SPTB, C6or07, KJAA1337, SNCAIP, SMOC2, PYG02, FU12770, FU40432, BMF. SLC27A4, Clorfl9, SLC5A1, CHRM1, 
FU14457, DKFZp434F054, SES2, MGC45474, BTC, APOA5, DKFZP434P106, KIAA1522, ZNF3I7, al/3GTP, PCDHB3. 
MGC26963, HSPC182, SNX9, NFAT5, C4orf7, NCAG1, K1AAI363, TAF6L. NAV1, KIAA1361, ZDHHC9, MGC2615, 
PHLDA 1 , AD-003, LOC90268. FU10101, PCDHB16, SLC2A12, CKLFSF2, FU23518, SEMA6D, PSID, SLC31A1. 
MGC10485, SLC5A2, ARHGAP9, NKD2, ETSl, FU90586, REN, FU14981, DKFZp76lH0421. DKFZp434F2322, MUM2, 

30 SPP2, MGC4734, FU13687, BANK, CNTN3, TLR8, HM13, FU36525, SLC12A6, DAPP1, VANGL1, MSH5, P5CR2. 

HAVCR2, CXCL14, GALNT5, ANKH, MGC29463, FU00028, TMPRSS6, AMOTL1, ODF3, MGC4604, ARG2, FU10052, 
FU13881 , PP2135, SLC12A4, MGC10500, MAP1B, DKFZp547l094, FU30473, FU12886, ST6GALNAC6, ESDN, SEC15B, 
FU33903, LATS2, ZNFN1 Al, SLC16A10, DSCR1L2, PSMB5. GPR34, FU20557, CGI-85, HCA127, DKFZp434Il930, 
FU908 1 1 , LOC 1 1 3026, FBXO 1 8, MGC872 1 , BLVRA, MGC 1 0974, PROl 635, MAP4K1 . HKE2, FU32 1 22, FU35867, 

35 FU10392, WFDC3. C21orf6, FU23654, DKFZP586D0824, C2 1 orf9 1 , ENTPD2, RGNEF, GPRC5C, RALBPl, FU31052, 
CI 1ORF30, FU30803, ITG A 1 1 , KJ AA 1 053, AGTRAP, NDUFS2, FU32069, ACTR1 A, SLC2A4RG, PPARBP, FU10055, 
C20orfl67, FU 12649. KJAA1909, IF1T2, EMR2, CDS, HT036, SERPINB9, MAP1LC3A, IGKC, ZD52F10. FU32028, 
BTEB1, FU20539, CCL28. MGC21621, KJAA1 130. KIAA 1 554, FU31937, RPL29, GSA7, FU25067. FU20989, LOC92689 
FU12604, MS4A6A, ELA1, SMOC1, CIQG, MGC 1 442 1, KIAA 1576, FU20245, LOC155066, PRDM6, DAP10, PCDHB14, 

40 FU25124, SNRK, ADAM TS 16, SES2, SECP43, EPSTI1, KIAA1948, NOL6, PALMD, PAG, MGC39807, TTY7, NUDE I 
KJAAI210, HRB2, USP21, C9orfl9, LOC93589, DKFZp434E1822, MGC 10561, RN02, GLCCU, MGC3234. AMOTL1, 
FU33868, B3GNT5, FAM1 1 A, SBBI31. FU23654, SLT, CPM, DKFZp762K222, NSE1. KIAA1817, NYD-SP21, LUC7L, 
FU13063, SIAT6, CASP14, FU1 1896, GPR92, FU25027, EVC, HOXA3. HTGN29, MGC4281, MGC15548, GSN, AD023, 
FU 1 43 1 1 , TAG AP, KJ A A 1 276, CGN, ZDHHC 1 2, FU2 1 736, FGFR2, LOC9 1 46 1 , GNG2, BACH 1 , KIAA 1 92 1 , KIAA 1 957, 

45 FU101 1 1, KIAA1 145, ARHGEF7, STARD4, retSDR3, HBXAP, ARFGAP1, NY-REN-60, RIG-1, X102, AF1Q, SYTL4 

ICAP-1 A, K1AA0872, LOC148932, SCML1, NOL6, Hcs4, LOC57038, TRPM6, ABCC13, CGI-85, DRLM. BCAR1, NR0B1 , 
MCOLN2, KIAA1 836, MGC35048, VIL! , LOCI24245. MRP63, TTYH2, FU14735. PRIC285, KIAA 1 999, GALNT7, EGR4 
DKFZp434F2322, PHACS, LOC51219. IX>C132158, PRO0971, SUM, SKD3, RNF26, TTTY6, TNRC18, CTXL, FU 12666, 
FU39957, FACL5, POLK. SLC25A13, FU3t318, ZFP91 , MGC19825, TPM2, PPPIR14C. LOC142820, ALDOA, EGFR-RS, 

5U FBX027. PRO0038, MGC 10992, NPCR, HCMCXiT-l, RSP3, PPP1R9A, KCNMB3, GPR55, ZFP28, PROl 635, C20orfl 54, 
FU32203. MS4A6A, KIAA1647. KIAA1607. BAZ2B. FU32752, ZNF216, PP2135. K1AA1357, MGC16207, K1AAI694, 
GBP1, FU10474, FU10826, ELAVL3, LOC90668, CPXM, MGC2452, FU20273. M1C2L1. FAD104. GPR107. MGC15419, 
SORCS2, ST6GalNAcl, RP4-622L5, DKFZP434F01 1, TNKS2, DKFZp761K2222, Ellsl, SLC4A1 1. KIAA1 163, CALN1 , 
K1AA1828, MEGF10, GR1N3A. REV1L. BHLHB5. ADMP, DKFZp667I133. MGC13275, KIAA1889, DKFZP434A236, 

55 GPS2, FU20309, NAV1. MGC2603, ARHU, FU33071, NUMBL, CDGAP, FU35713. DKFZp761 A132, FU10300. 

FU12634, GTF3A, NEOl . RRAD. MGC10966. PTPN2. FU10292. ACPP, CISH, DOT1 L, POLRMT, CGI-149, KIAA 1 202. 
DKFZp761 J139, MGC40178,GATA4, EVIN2, MS4A8B, FU 10057, NDUFV3, SF3bl0, RP2, FU21032, CLG, MGC3040, 
ODZ2. AQP1 . DKFZp566F0947. CCL27. TARD9, MGC40222. DKFZp564C236, SDS-RS 1 , SNCAIP, ENDOGLYX1 , CGI-30, 
FU10314, MGC20470, KLHL6, K1AA0212. PRO0899, KIAA1894. FLN29, FU20373, GTF2I, GJC1, BHLHB3, CPNE5. 

OU GPC6, IL6R, RRN3, DKFZP564J047, C20orf99, CED-6.DKFZP434P1 735, TG1F2LY. LOC83690, GPRl 1 0, FU34922, 

FU2021 1.FREQ, USP26, MGC15634, ZSIG1 1, ZFHX2, C7, UNKL, LOC151835, MGC21854, FU25410, EGLN2, KIF9, 
KIAA1550, CIP1, DNAJC9, FU14768, MGC2599, LOC57018, DDX12, MGC33993,SLC22A3. KIAA1399, DKFZP434F091, 
EG1, SE70-2, DKFZP5641 1171, CDH26, TRPC7. DKFZP566K1924. C20orf60, ROR2, KLHL5, SCARA3, PRO1580, 
MGC 15523, DKFZp434C0328, FU3I528, CR1L, FU32734, NXF3. MGC4I906, CLECS F9, SSBP4. ZNFN1 A4, FBX022, 

65 NCAG1, MAP2, KIAA 1529, TIGD5. SNX9. FU32001, RPC5, AK2, KIAA 1887, ACK1. FU37312, ARSD, FU3I564, 

LOC51 136, MYEOV, GNAI1, MGCI2335. FU20356, KIAA1617, HNT, C21orf59, LOC221468, ENAM, PB1, TBXASl, 
NMNAT, MGC10204, TNKS1BP1. LOC57401, FU32194, ENTH, APOA1, 1TGA6, MGC12458, FU23403. BCL10, H19, 
C7orQ, DNER, PDEI 1 A, MAF. FU10378, MGC14276, TLEI , SH3GLB2. TTTY8, KCNH3, LOC90693. ENDOGLYX1 
LOC 144402, CGI- 105. LOC 153222. ASAH2, MGC4415, KIAA 1495. SFRS12. and AGPAT3. 
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Table 5 

Up Regulated in UPTG versus HSC 

CFLl , S100A8, SERPINA3, UBC, MUC1, SFN, ANXA2, ANXA2. COX7A2, HSPA1 A t KRT18, ANXA2, OAZ1 . TMSB10, 
CA12, DNCLI , CEACAM6, ASAH1 . RAC1, ARF4, TACSTD2, MYL6, MSF, JTB, CKAP4, TFF1, 1ER3, GATA3, IF1TM2, 
5 SFN, MTCH1, TPMl, CD24, NET-6, MLC-B, MLPH, QP-C, SCGB2A2, S100P, SI00A9, COX6A1, CAPN2, COX5B, CD24, 
H2BFT, XBP1, FXYD3, RNP24, PTS, GSPTl, COX6C, TIP- 1, HIGl, RPS16, SAT, HSPCA, TPD52L1 , TMSB4X, SIOOAIO, 
JTB, RBPMS, KRT19, FU10830, TUBS, JTB. ITOB I . CEAC A M5, MT2A, LIV-I. HN1L, LCN2, LOC51U2, LOALS3. 
RAB13, FTH1, TCTEL1, IFITM2, SI00A7, PSMB4, MAG EDI, FU20151, DB1, COX6B, C20orf24, ARHA, NFIB, PTP4A2, 
NDUFB2, CALM 1 , ATP1 B 1 , GNG5, CD63, NAT1, S100A6, E1F4B, ESR1, HSPB1, TAGLN2, ALCAM, NDUFS6. AGR2, 

10 C8FW, TXN, HDLBP, NDUFA4, PP1C, GLOl, RABUA, LPP, HDGF, CALM 1 , MGLL, PTS, ARF1, DC 1 2, SNRPD2, C4A, 
RAI3, NDUFA6, ATP6V1D, MLCB, TEGT, DSP, PNN, ACTN1 , NIFIE14, NDUFB4, DAF, VAV3, UBC, SSR2, MKNIC2, 
HSPC014, KDELR3, TACSTD1 , DKFZP564A2416, ASAH!, DDR1, ENAH, KJDELR2, DNCI2, PPP1R1 1, PP, SERF2, CTSB, 
SSR4, GNAS, PGM1,CEACAM6, PDL1M1, GATA3, MGC3178, SHCl, GOLPH2, GNAS, VAMP3, S100A14, GABARAP, 
ALDOA, TAXI BP 1 , LASP1, NFIB, CCT3, AQP3, DBI, VCL, GNAS, ALDOA, COL3A1 , ATP5J2, MGC 16723, USP9X, 

15 TMEM4, MTX1, HSPC134, ZMPSTE24, UQCR, AHCYLl, GOCAP1, HT01 1, EDF1, CRIP1, FXYD3, MRPL9, R1P60, 
TIMM17A, BF, RER1, DC50, CTBP2, HEBP2, YIF1P, LOC54499, APMCF1, UGDH, PSAP, SPEC1, FU12619, TUFT1, 
COX5B, LRPIO, ATP6V0E, CYP27A1, PON2, NQOI, PTPRK, EIF4EL3, GNAS, CLTA, MDH2, TCEB2UBE3A, TM9SF2, 
MUC1 , RARRES1 , PRDX4, MIF, TPD52, CD81 , DSTN, HRY, HSPC051 , SMBP, HDGF, C I4orf2, BRD3, NHP2L1 , PPP2CB, 
DLG5, ASS, ENS A, MAGED1. CD59, SHAPY, CAST, JDP1, HK1FBX09, RPL38, INHBA, EMS1, HR1, APP, HAX1, 

20 FKBP1 1, GOLGB1, SPINT2, GORASP2, CD24, HSPA1B, FU 13593, MGC5466, E2F4, PRO 1855, UBE2V1, KJAA0882, 
RPL36AL, CSTB, ATP5I, OASIS, DKFZP564K0822, RCP, MAGED1 , PSMB5. NDUFS2, YWHAZ, KIAA0310, RPL38, 
FU20273, RAB3-GAP150, PSMA5, ATP2A2, C20orf97, TUBB2, RAB31, C9orf7, HIGl, INSR, TPMl, GSPTl, PSME2, 
CSNK1 A1, P4HB, E1F2S1, LOC92689, NDUFA3, KJF5B, PAM, MTIH, SHAPY, FU10898, GUSB, BNIP3, KJAA0992, 
FLOT1 , PSMB7, TAFI0, CSNK2B, EPRS. PIG7, DAP3, ECHS1 , AP3D1 , COX8, PMP22, LOC54499, ALDH3B2, MGST3, 

25 PRDX2, PTD01 1 . COX5B, CAST, LASS2, PSMB2, MT1X, MYD88, DKFZp56411922, FU20719, C4B, H2AFL, FLOT1, 
PIN4. TCEB1 , WFDC2, SQRDL, CSTA, PTD009, PTPRF, DAD1 , PDEF, FN1, GPX4, DDR1, ARHD, COL1 A2, PDEF, 
HSPC009, MEA, ABCD3, CYB5, MLCB, PR01489, PDEF. RPS11, IDHI, SLC12A7, H2BFB, SH3BP4, CD24, SLC38A1, 
RAB3 1 , LTF, TIMP1 , SH3YL1 , SEMA3F, TSPAN-I . KIAA0852, NDUFA8, COL1 A2, SLPI, PSMD4, RPL27A. GNAS, 
KJAA0876, DPI, CEBPD, CIBI, IQGAPI, TSGI0I, MGC3077, CYB5, FN1. LOC51 128, EMP2, CETN2, PACS1N2, PBEF, 

30 MRPL24, CTSB, SDFR1 , MLP, TM4SF1, C20orf3, PRKAR2A, MGC5178, FN1, FU20054, MMP24, SEPX1, GFPT1 , 
ANXAI 1 , ADFP, GMFB, AP3S2, PTBP1 , BAG1 , FU10496, CYB5, CXADR. RAB25, FH, APP. CDR2, PSEN1 , RFP, 
SEC22LI , GGPSl , ARMET, USP7, FU20847, EFA6R, HSPA4, RDBP, TNFSF10, DDR1 , K1AA0429, PLP2, RABGGTB, 
BAG3, IFI27, GATA3, LAMP2, CD24, MRPS14, FHL2, CGI-130, CPB1, SCAMP3, NESCA, BACE2, PSMD8, LGALS1 , 
MPHOSPH6, FU14154, COPZ1 , CALR, HK2, WIRE, PTP4A1, TRA1, DKFZP564G2022, CTSH, CRAT, PLAT, ANXA2P2, 

35 YME1L1 , PILB, ITGB5, KJAA1026, FKJBP4, TBL2, PIGT, WSB2, IF130, TUBB2, E2IG5, YMEI LI , ATP6V0E, RAB4A, 

LOHl 1CR2A, PLU-1, KIAA0483, SLC2A1. LBP, MGC1 1256, FMOD. TLE1, POLR2H, TOB1, NSF. TACC2, OPN3, USP3, 
PSMB1, TMP21, DUSP4, RAB2, SVIL, FDFTl, NFE2L1, PTGIS, RPP20. PGLS, ORMDL2, NR2F6, PIG7, ERBB3, TRAP1, 
DDR1, SDC4, HSA243666, PLU-1, ATP6V1 El . DAAM 1 . GSN, MCP. KIAA0143, PI 7.3, PIN4, WARS, FN I, TFG, COPB2, 
ERP70, MRPS18A, C22orf5, LYSAL1, POLR2I, SARI , ATP6V0B, TUFM, NDUFB?, BCL6, PDCD6IP, TR1M33, UBE2N, 

40 WBSCR21, NEDD5, LOC51 123, GMFB, PFN2, KRTHB 1 , NANS, CLU, TOMM20-PENDING, NDUFS8, MT1G, ANK3, 
PIK3R3, IL13RA1, TNFSF10, DNPEP, TNRC9, NIPSNAP1, BRP44L, PEX1 IB, FU13612, FU22028. POLB, ANXA4, 
SEC61G, PREI3, CDKN1A, MT1L, SAS, PSMD5, COBL, CARD10, UBE2D3, RABAC1, CPD, C21orf97, PAM, MRPS10, 
CGM09, GBP2, TC10, NMA, FASTJC, P4HA1 , GTF2I, COG2, MY06, LMNA, TCF3, Cl4orf3. PEA1 5, PRKCBP1 , GALNT3, 
IRS 1, ACPI, GUK1, MBD2, PTD008. RBM4, TNFRSF10B, K1AA0266, NQOI, DNAJA1, FACL3, FERIL3, CD59, PPAP2A, 

45 FACL3, KIAA1598, TGM2, MTMR9, LOC51760, TST, TM9SF1, LGALS3BP, P24B, D2S448, RPL27, KDELR2. TJP1, 
OPTN, NME2, HRI, F12, RABIF, TJP2. ATPIBI, GGPSl , FU101 16, PTGES. SC02, PEN-2, PSMB3, CDS2, RAD23B, 
PPM 1 A, ARL3, TXNDC4, GOLGA5DDX32. DAG1 . VIL2. TPBG. GM2A, EIF2S2. NEUGRIN, DKFZP564G2022, 
KJAA0934, ADM, CSRP1 . GRIM 1 9, FAT, SLC21A1 1 , ACADVL, NDUFA2, GALNAC4S-6ST, EIF5, RAB1B, NMEI , ASPH, 
MUT, ARF4, FBXL1 1 , COPA. UBL5, CSNK1 E t ATP5I, CCND1 , HT021 . PPPI R7, LOC56851 . SRP54, DAF, CTBP2, TLE2, 

50 HSD17B1, SRJD5A1. SLC9A3R1, MUC16, PPL, MGC10765, EPB41L4B. SECTM 1 , CHPPR, SORD, VTMB. CRABP2, 

EFNAl, HERPUDI , CDYL, MRPS17, SGPL1. DUSP14. SSBP1, C20ort35. C3, HSPC163, ATP6V1G1, YF13H12, FU13052, 
ABCC10, STUB1 , NNMT, RAB20, CALU, PLCB1, NR2F2, HSPE1, TM4SF1 , RSN, FU20813, TPARL, SEPW1, H1F2, 
GRHPR, HSPA 1 A, RAB2L, SARS. FIBP, PSMB6, RER1 , BCL10, ATP9A, IDS, PPIB, RAB2. Cab45, PYCR1 , GSTM3, 
SEC24A, MAPT, FU10579, ADAM9, FU21603, DNAJB1 , C20orfl 16, DKFZP564G0222. RDGBB, RRAS2, AKAP9, 

55 K1AAI243, DC1. ELF3, PDE4A, CRIM1, COROIB, PXMP4. S100A13, DPP3, GTF2H2, PSMB8, TUBB4, MRPL33, STKJ9, 
VCAMI, MAOB, DKFZP566C134, CSNK1A1, FU20761, EGFL6, ATP1 Al, APH-1A, FU22055. TOPI, RCL, SMT3H2, 
POLR2K, LMNA, ID4, JTV1, CLN5, AKIP, TGFB1, LLGL2, ITGAVPPP2R5A, IFNGR1. JAG1 , DD96, PGRMC2. SNRPE, 
MGC19606, DJ971N18.2, CKAP1. MGC3180, HYOU1, PACE-1 , FU22662, KJAA0674, ALS2CR3, EPLIN, MYOIC, CD164, 
PCMTI , 1L1R1, SERPINB2. HSD17B4, FOLR1, HRIHFB2122, FU22457, MXIUTCFL1, POR1, FU20375, H4FD, KRT7, 

60 TFAP2B, MRPL15, SLC5A6, RGS16, TNFAIP I . FU 1 4 1 46, HOXB7, P1K4CB, RPS20, CI Iorf24, SYNGR2,NCKAP1, APG3, 
RHBDL2, ASC, Clorf9, KJAA0247, HRB, PAFAH1B3, SNK. ASB13, LSM1 , GPI, MCJ, CASK, HOXB7, RBBP6, PK1G, 
SMARCA4, BLVRB, HYPK, SUCLG2, K1AA0494, SLC2A10, HIG2, TSTA3, TNRC9, SEC23B, SELENBP1, RAB6C, 
VAPB, ZNF144, PCNP, SULT1 A3, NQ02, SMP1, FU30656, NUBP2, FU20152. ATP5H, FU22418, DCN, SOD2, FU20958, 
YWHAZ, TRPS1, CYP51, SUCLG2, CGI-45, ZFP103, MID2, CPD, TFAP2C, Clorf37, dJ222E13.1 , 1CMT, UNC84A, 

65 CALM I, DF, SUPTI6H, BZRP, SLC9A1, FUI31 10, ATIP1, MUC5B. CTSB, GJA1, SDHC, SUCLG2, MGC3067, PBEF, 
IL27w, HSD17B7, GRSFI, CD9, HI 1 , FU10099, NIT1, LAMC1, HBXIP, NDUFV2, STXI2, SDHA, D123, Z39IG, RPL5, 
PA200, SC4MOL, HSPCI 71, STXBP1. CACNG4, MAGED2. MGC4368, MPZL1 , ZDHHC7, RPA40. IGSF3, FU22638, 
SPTLC2, MTVR1, FU21016, SGK, NCOA1, MAP4, GLRX2, P4HA2, JAG1, MTVR1, FU22940, NDUFB8, 1SGF3G, 
B4GALT5, EMSI, C22orf2, LRPAPI, PON3, EIF5A, ENSADKFZP564F0522, FU1 1273, EPS8R1. EDF1, ISG20, EPS8R1, 

70 FU10525, PSMD4, NINJ1 , TSSC3, FDPS, RGS3, CREB3, UBE2DI , ProSAPiPl . CAST, WBSCR20A, MAPKAPK2, RPP38, 
YWHAB, A2M, RBXI , PDGFRA, EFS2, RAB9A, RAD23B, BAZ1A, BCL3, SNX4, CLMN, HRY, INHBB, NPD009, 
AHNAK, TNRC9, S100A11, MYOIC, LDLR, K1AA0102SCYE1 , LARP, GNA1 1, NDUFA7. CKAP1. KPNAI , NDUFS7, 
RDHI 1, RAP140, MTCH2, HPGD, ITGB4BP, CLDN7, CG1-147, GTF2IRDI, LRRFIP1, DAB2, DKFZp667G2l 10, LGALS8, 
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MARS, MGC14480, MGC3038, PLXNB2, ZFP36L1, DBI, AP2B1, PLS 1 , CYC1 , PP1F, COL3A 1 , PDHB, NSAP1, PFDN2, 
GAS2L1, DMBT1, FZD1 , GBA, DNCL2A, VCP, MYOIB, ANXA8, CI lorT13, DSS1, KIF1 3B, CECR5, GARS, COPB, 
NFE2L1, DLG3, FLJ12443, NALP2, APM2, KIAA0790, CIS, HN1, GALNT6, CLPP, STK24, PP3 111, MTAl, CAMTA2, 
BAT3, FADD, BC-2, CLOCK, UAP 1 , AAK1 , MGC3 1 2 1 , CD 14, CDC2L5, FYCO 1 , SQSTM I , UBE3B, CSPG2, EIF5, DEFB 1 , 
MTMR6, K1AA0643, 101 F6, SLC35A2, TNKS2, TPMT, WWP1, LHFPL2, NEDD8, PC326, PTK2, FU20748, FOXAl, JDE, 
FU20275, CACNB3, CDC42, TEX27, KIF3B, PP3501, CDCP1 , HNRPU, TULIP1, SPARC, DVL1 , GMDS, EZF-2, AP2S1, 
GNAU, SEMA4C, WT1 , KIAA0010, LAMA5, PTDSR, ETFB, KJAA0284. TFF3, GRHPR, RPL37A, Gl P2, MGC1 1242, 
FU23189, FKBP9, MGC35048, RTN1, ASL, PTK9, THBS2. SDHC, HIS I, DSTN, MGC3047. PAFAH1B1, AGPAT1, PGM3, 
AKR7A3, COL1 Al, KJAA0436, GDI1, CYR6I, RNPEP, SGPLI, APBA3, GNB2, SOCS5, FGFR3, RGS19IP1, ORC5L, 
SLC6A14, K1AA0229, FLJ22028, LAMP1 , SNRPD3, MAPK13, DNAJA3, FU22471, CKMTI, PSMB4, CCL22, CLU, 
CD163, ANXA3, ATOX1, GTF2E2, ANXA6, FU21 127, BMPR1 A, WBSCR20C, MBLL39, IL4R, SEC24D, SLC19A2, 
RNASEHIALAS1, ACAA1, DPM3, ABL1, TUBB4, EFNB2, CALR, ARPC1B, MCP, SH3GL1, ECT2, LOC51619, NEK1 I. 
MAFB, EFEMP1 , G10, DPP7, FUT2, ATP6V0E, SLC22A5, SSH-3, SYNE-2, PH-4, CTBP2, BATF, PDE4DIP, TR1P6, P2Y5, 
RNASE4, CANX, CD2AP, HIP1R, FH, ADCY2, SPUVE, FU10462, QSCN6, CLTA, SLC31 Al, DEPP, CLTB, K1AA0544, 
CTSB, MARS, PAK4, PHIP. HIP2, FU23375, ARHGAP8, TNFRSF 1 2A, KRT8, UBE2V1 , PDPK1 , KIAA025 1 , PPGB, GAS 1 , 
RAD17, PIAS3, 37872.00, ABCA1, FU10375, KJAA0217, SPR, GRN, E1F2B4, ITGB5, RPN1, APLP2, WDR1, SDC1, 
MGC2963, PM5, MGC5 1 78, TBCE, EEF 1 D, SGP28, FEM IB, FIJI 0829, FLRT2, KJ AA0934, PCDHGC3, COPS6, PARTI , 
ACACA, AMPH, LUM, FU23338, EPHB4, FBP1, WSB2, HBP1, EVA1, MUS81, POLR2K, K1AA0103. HPS1, LOC55831, 
FEM1C, RJN2, DKFZP564O092. ENDOFIN, DHCR24, FU20604, LOC9014I. PCDH7, SLC7A7, SLCI2A2, FU21047, 
S100A11P, CGI-1 15, TOM1 LI, Clorf34, SOX9, MB, EIF4EL3, S100A8, APLP2, TDP1, FGF13, URG4, RARRES3, 
FU1 2910, DAP, RFX5, MVP, FU21749, PAXIP1L, FU20152, ATF7IP, GPSN2, RIL. VEGF, TM4SF6, SPP1 , NVL, CALR, 
CKAP1 , AKAPi , HSPC166, TMPRSS3, TM9SFI , LOC56902, ENT3, GRB2, COG5, DOC1, COL5A2, RLN2, GRN. ADCY9, 
KJAA0690, ENPP2, ILF1, SLC35A3, SLC39A1, C20orfl 1, PCDHGA1, CGI-1 48, WBSCR20A, CSGIcA-T, KIAA0937, 
KJAA0674, LTBP1, H2BFT, SEMA3C, SULT1AI, ERP70, KIAA1078, KIAA0869. PLA2G12, PACE-l, K1AA0984, AUP1, 
RBSK, AMOTL2, SULT1 A3, LANCL2, PAlPt, JUP, PPP3CB, KYNU, SH120, PRKCI, ARG2, OSBPL2, APOL6, GATM, 
LOCI 13251, GM2A, FU12436, CD24, SYNGR3, HSPA1A, CTNND1, SEC61 Al, IFRD2, PCK2, PSMA3, COL6A1, 
ARHGEF5, RAI, VPS45A, BECN1 , GNPI, PA200, PXF, BZWl . K1AA0876, KJAA0471 . ATP6V1D, CRYM, K.CNS3, FARP1 , 
ANK1, FU20234, PLU-1, NPC1, ZNF339, RNF14, RBPMS, SEC13L1, KJAA1630, SIX2, SGSH, RPA3, VLDLR, ENPP1, 
ITSNI, AP2BI, ARHC, SWAP2, UBE1, MARJC4, MK-STYX, HDLBP, ZNF185, K1AA0227, GOLGA3, K1AA0033. RAB26, 
SHANK2, ALDH3A2, DCN, HT008, PLAB, IMPDH1, GRIT, FARP1, MAPK13, ERBB2, TGOLN2, RALA, ARHE, ABCF2, 
PRSS1 1, PLCD1, HSPC1 11, TRIM29, ARL1, ACTN4, MUC1, DJ434014.5, FUU619, SH3GLB1, TCN1, FU1 1 149, BIK, 
ZNF91, PRSS8, CYB5R1, TRJM16, EPS15R, NARF, SLC1 1A2, AUTS2, LIM, SLC1A1, ALDH7AI, TC10, SC65, IRF7, 
HLXB9, RAB17, KIAA0746, PCDHGC3, APOC1, AKAPI, EPS8R1,TBCC, DDAH2, TYROBP, N33. FLNB, 
DKFZp564A176, PRE13, JAG2, UGCG, OSR2, KRT6B, CDC42EP4, TPD52, C20orfl49, FUJI 2975, MAN2A1, GCN5LI, 
MCF2L, FU22386, STHM, RAB26, API SI, GMPPB, CYP2B6, F-LAN-1, PRKCZ, DC-TM4F2, KIAA0556. FU12619. 
CD163, DAZAP1, T1MM13, MADH2, COL4A5, POGK, FXC1, POP4, NET1, ARHGEF5, NS. KMO, PTP4A2, LOC57228, 
MUC5B, AUH, BAIAP3, SFMBT, CD44, BYSL, FU20085, PARGl.C4.4A, PSMD4,GSK3B, PSMD12, EIF2AK3, SCARB1, 
DPI, STRN3, FU23263, CTSD, HSGP25L2G, TFIP1 1, MPZL1, SNAPC3, RBM3, PP591, TGFB 111, GRHPR, AHR, 
FU12389, SORT1, KDELR3, ATP6V0C, MD-1, D8S2298E, XAP135, HSPC023, C9orf7, C21orf97, DNCH1, ZNF36, 
PPP1 R7, VIL2, RAB2, MYH9, TRIM 14, UGTREL1 , CTSL, KJAA0977, RPC62, UBE2N. DCAMKLl , FUCA1 , ATP7B, 
RBSK, ST5, CGI-90, NOH61, FUI0925, RAB22A, RTN2, KIAA0089, SH3GLB 1 , CDS 1 , MGC5466, WFS1, AMMECR1, 
COX17, ACOX2, FU10I01, HT012, LMNA, PRDX2, SULF1, KIAA0923, FU22637, SCA1, PAIP1, CAP2, CMT2, ZNF217, 
CYB561, PAPSS2, STX18, FZD4, DDXx, UBAP1, ITPKC, PTS, PGLS. LAD1, DSC2, STOMLI, DDX16, PTP4A1, 
FU10901, SLC12A8, NME3, TEM7, NPR2L, ACY1, GNB1 , GRN, PLEK2, KRAS2, ARHGAP8, FU1 1856, DCN, 
LOC5587 1 , NAGK, FU14154, FU22709. TP53TGI. STK6, COX5B, MICA, EPPKI, EPS8R2, MMP19. WWPl, TUBG1, 
LBP, ATP 1 0B, CLN3, UBE2G1, SULF1, FU30002, SYN47, CSPG2, CACNB3, IGFBP3ELOVL1 , DTNA, ANK1 , CI2orf22, 
EPN3, IDE, DKFZp76IF2014, SEC22L1 , ILF2, ACTR1 A, FU10052, STAT3, CED-6, FUI0359, SOX9, PIASY, K1AA1 169, 
CAV1, HOXB2, FU22191, LOC571 17, PMVK, BLNK, TREM1, HSRTSBETA, EIF4EBP1, SIGIRR, TSLRP, C20orf44, 
PTD009, PP1665, HMG20B, RTCD1. PDE8A, CNNM2, GNAU, GPX2, KIAA0599, FU13868, DBN1, GEMIN6, PMM2, 
SPTANI, PFN1 , DCTN1 , UBE2A, GPR107, MRPS2, SNARK, SSA1, SHI 20, UBPH, CPD, HOXC6, DXS9928E, TEAD3, 
PGBD5, ST14, CNN3, KIAA0256, MGC3262, FU13840, CLDN4. FUI 1939, ABC A3, OAZIN, MRPL17, PPP2R4, CGI-135, 
KIAA0802, AP1M2, SCN10A, PPIB, MRPL40, ZK1, FIJI 2517, CDH1 1, CDC42EP2, CLN3.CGM52, FU10815.CI lorfl3, 
MADH1, FU20539, HMGE, KIAA0923, LAP1B. PTGDS, FU20559, SFXN1, KRTHB6, UNC13. MUC4, FUT8, NETI, 
NEBL. BCS1L, RAI 1 6, CAV2, FA AH, CEACAMl, LEF1 , GALNT10, NAGA, ABHD3, STOML2, Clorf27, OSTFI, 
K1AA0227, PCLO, MYO10, THBSl , LANO, HMCS, H3FK, SPS, C9orf9. PITPN, SCRIB, PAM, NPDC1, ASNS, SLC33A1, 
HSPA6, HMBS, FU21918, FUI 1939, C6orf29, PRSS15, ENC1, HTR4, SSH-3, RECK, NAV2, TRN-SR, MRS2L, FU20366, 
LOC51754, LGALS8, KIAA1040, B4GALT1, FU21841, KJAA0237, IL8RA, MLF1, ANXA9, VRP, LOXL2, MIR, ATP5D, 
KIAA0632, FU20174, FRAT2, DDX26, BCKDK, ATP6V0A4, KJF1B, ENTPD3, RAB1A, EGLN1, KIAA0268, LGMN, 
PTPRH, KMO, UGCGL1, AKR7A3, R1G-I, CYB5R2. FUI 1773, RPS6KA2. CLCN3, PTPN18. GNG12, PKP3, ALDHIA2, 
NEK3, UQCRC1, ZNF236, RASAL1, RPL14, FUI2287, AP1M2, C4BPA, MAF, FU10815, FU90798, TRAM, POLR2J. 
TLN2, DNASE2, PEX1 1 A, KIAA0790. TM4SF3. HPGD, TR1P10, THY1, CGI-143, TPR, AQR, CTNNDl , HOXC10, 
CDC42EP4, PLEC1, PSFL. PTP4AI. FU22353, NCALD, INPP5E, MKRN4, PADI2, SMARCA1, KIAA0317. EHD1, AZGPI, 
SMARCA1, NOVA1, MRPS1 1, FU23091 . HOXC4, OCRL, CKAP4, CD44, CD2BP2, FU10055. TM7SF1 , PVRL2, ID4, 
DJ434014.5, SLC7A8, DKFZP5641 1 22, MIPEP. PLA2G4C, KPNB2, DAXX, NOS1. ID3, MRC2. SSSCA1, PPP1R14B, 
MTHFS, HSPA5, ELF5. MARCKS, K1AA05I4, RRAS2, ADRMI, ANK1, KJAA1324, PSEN2, UBXD2,CALU, DOK5, 
KCNMA I , COL9A2, ATP2C1, FGFR2, DPM2, KIAA0895, DPH2L1, MUC5B, SSRl, LOCI 13146, KIAA0644, LOC5I042, 
DNAL4, PI03, GPS2, CX3CL1, INHBC, C1QB, PDPK1. RPLP2, HRJ, MOC4825, TGFBR3, LAMC2, PEX7, HFE, 
DJ434014.5, FU20296, MGC5347, FU 1 052 1 , RARA, KLC2, SLC21A2, SPTAN 1 , APOC 1 , LARGE, STK38, GCC1, SNX13, 
TNNT1 , NTRK3, TGIF, H3FH, KIAA0485, KIAAI416, EFEMP2, SMARCE1 , KREMEN2, UMPK, K1AA0268, DDEF2, 
VAMP3, CGTHBA, OSBPLI0, CGI-96. MGC3248, TUBB-5, PXMP3, RBM9, LOC51257, LAM CI , SLC30A5, PPARD, 
KIAA0349, MAP4K4, GNG4, CCL3, GPRC5C, CCR1 DKFZP586B0923, RNF10, SCGB1 D2, VIPR1 , RGL, TESK1 , AK3, 
KIAA0649, SCARB2, MGC2494, FU20048, EPS8, DNAJC1, MOB, FUI 1200, CD1C, AGP ATI, FBLN1, GW1 12, ICT1, 
CGI-141, DSCR1, PIP5K1C, PRY. ALP, PRDM4, PLAC1, ISG20. FU20457, TCF-3, PTE1 , TNKI , MAGED1 , FU13782, 
NPD009, UCHU, PRE LP, LOC55893, KJAA0451, AK1, LMCD1, NET-7, AP3B1, OS4, ABI-2. NOTCH3, KRT9, COPZ2, 
CGI-58, RISC, DKFZP566C243. ATP6VICI, TRIM38, PTOV1, PDGFB, PIP, IDN3, FUI0199, BCAT2, HOXA1 1, PDXK, 
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NEDD4L, MGC29816, TPD52, TMPRSS4. HAIK1, SUPT4H1, WNT3A, PCSK7, ANK1 , FCER1G, FU13397, EROIL, 
BPGM. HLA-DQA I . DCXR, KIAA1094. NEOl, FKBP4, SMARCD3, TPSG I , FU2 1 940, APBA2BP, TMPRSS6, TBC1D1, 
MS4A6A, U2AF1 RS2, MOC1 1 308, MRPL23, PCDHA12, SMA3, CELSR3, SLC22A4, MOEA6, ICA1 , STX4A, EFS2, 
RRP22, X123, GBA, DNAJB1, T0FB3, CRAT, FU1 1159, TMEM8, OALE, FU20333, DDX3, TULP3, TACC2. SLC6A8, 

5 C3AR1, BSCL2, TRIM2, ELF3. SPTBN5, SERPINB8, FU23259, TNFRSF6, M1PEP, CELSR2, LDB1 , MOG1, PXF, HPIP, 
HMOX2, SURB7, HRJHFB2I22, FU22056, CLASP2, IF, HSKM-B, UPKIB, WDRI0, 1QOAP1, PSPHU DUSP4, FU10856. 
RARRES1 , ALAD, PARVA, KIAA0608, DNPEP, GMPPA, FU20254, IDE, COL5A1, GFER, PSMA7, FUMOI 7, ZNFI44, 
MYC, PEX14, CCR5, ARU.NME5, NDUFB7, PPAP2B, C2lorf80, CAPG. MRPL52, MIG2, HSPC039, DPH2L2, SRD5A1, 
SDRI, RAB36, SCGB2 A I , PRDM4, ASM3A, FRA, GLUD1, FLJ13187, CARM1, RPS6KB2, LOC55565, B3GALT4, 

1 0 ALOX5AP, PLAU, DMN, DFNA5, CG1-36, TCIO, SLC38A6, KIAA0852, CRJP2, HSPC003, NSFL1C, FU20605, GPCI, 

FU10504, MKLN1. T1P-1, SCAM-1, IL13RA1, UPLC1, FU20171, LOC88523, HSD17B2, MYOIB, ZNF364,CDK7, MAP7, 
PCOLCE, IL13RA1, SSNAI, ESRRA, CPS!, APOE, MY0I4, CHK, THBS3. DAB2, PCMTI, MAP7, SLC7A4, APPD, ITCH. 
KJAA0255, BCMP1, AKAP9, SNCAIP, MRPS7, PIGPC1. HIVEPI. SLC6A8, DKFZP564O0823, CRK, BAIAP2, SLC7A1 1, 
CPE. MPZL1, TD02, FUT1, STAB2, CDKN2A, CGM2, TPM4, IL1 RN, MGC4504, KIAA1395, COQ7, CARHSP1 , PARVA, 

1 5 FLNC.CI lorf24, NPR2L, OFPTI, ARVCF, CAPN9, SRRM2, NBL1 , KIAA1078, SURF5, ARHGEF4, F23149 1, FKBP1 1, 
K1AA1 102, IGF1, RBT1, HNOEL-iso, LAMB2, DKFZp566O084, FGB, GPNMB, TLR5, CX3CR1, THBSI, GORASPl, 
HCA1I2, AQP3, BDKRB2, SLC4A7, CLTB, MRPS18A, CTSK, CELSR2, KIAA0652, NKX3-1, MXD4, ALDH4A1, DYSF, 
ECGFI, DCN. PSME3, TIMP2, HOXB6, EGFR-RS, EPS8R1, ECM1, LTBP2, PRPS1, CDA08, HUMAUANTIG, MGC955, 
FU22678, LAMA4, GLUL, MAGED2, HES2, FASN, CYB561, IDH3A, MPPE1, PRKAR1A, KDR, DICERI, PROCR, 

20 TNFSF10, HAGH, FBX03, TC10, PRKAR1A, ZNF20. AK1. ALDH3A2, FSTL3. ZNF408, PTP4A1, PMS2L9, BAG!, 

DKFZp667G2l 10, MUC2, KIAA0265, ZFP100, KCNK1, IFI35, THYI, FLI23186, H2BFG, ARSA. KRT15, 1CA1, FLNA, 
BPHL, PCTK1, TUBA2, KRTI7, SHANK2, CEACAM I GAK, VARS2, AGTR1, ASB8, MPZL1, RFPL3, DNMIL, SPUF, 
KIAA0792, NUCKS. CI R, HRASLS3, TM4SF6, SPINT1 , XT3, SLC16A5, FU2I079, MST1 , MMP9, DKFZP434B044, NY- 
REN-24, ALDH1 A3, NID2, KJAA0409, ANKRD5, KIAA0513, U2AF1RS2, 1GF2R, H2BFL, FUT3, LEC2. LY6E, CSH2, 

25 SRCAP, DKFZp434G23l I, CHST4, PPP2R1B, PVALB, FU12960, ITPR3, PODXL, PARD3, PRSS22, FUI0697, MGC2376, 
SLC39A4, MRPS16. QPRT, GFRA1, BRD2. CNGB3, LAK, C5orf8, PPP2R3A, HCGII-7, ANK1, OAZ3, PSMC4, ACATE2, 
DKFZP434L01 17, EDAR, PPFIA3, GRB7. MCM3AP, CALB2, APXL, ABI-2, TTR, CSNK1D, DJ1042K10.2, TR1M38, 
PSCD2, HSPC134, SREBF1 , HUS1 , PSK, C12orf5, SPOCK, EDG4, FUI0769, ANKRD3, FU21 135, PPP2R4, CED-6, 
GATA6, MGC10963. ZNF14, CPR2, KIAAI 199, H1P1R, NOL3, ZNF306, FU14298, RAGE, IDH3A, GPR107, KIAA0368, 

30 RPA40, MEIS2, PHLDA1, CELSR1, N33. BLZF1, FU22637, 1L1RL1, GOLGA1, SARI, FGFR2. FLU, ANK3, SIRT7, 

BAP29. EFEMP1 , FU20277. DXSI 283E. LAMB1 , TLE2, TJP1 , PDE8A, RCV1 , HYAL2, ERdj5, KJAA0350, CLSTN2, MDK, 
LOC51762, APOE, K1AA0964, SSH-3, TJP3, ZNF193, PRDX2, PTGDS, TEM7, DNAJB4, POLR2D, DKFZP586J1624/ 
JAM I , LHX3, FU 1 0252, KIAA045 1 , INE2, WIT- 1 , FU23209. CXCL1 , RAI2, KI AA0857, FU2 1 062, KJAA 1 096, ARF4L, 
THBSI, RAB31, SSt8, NDRG3, TGOLN2, FU 10665, COL6A3. TAZ, AGRN, PGC, SOX 1 1 , MCP, EXTL3. ACRVI, NELL2, 

35 MGC4309, LOCI 14990, KYNU. SNX1 1, ANGPTL2, CYP2J2, SMURF1, SDCCAG16, BRAF, NFYA, ADD1 , LIG3, CAV1, 
BIRC1 , TJP3, STEAP, NDUFA2, MYBPC3, CINP, KJAA1096, ACLY, TUBB, GREB1, MARK3, TEAD4, CGI I, UNG2, 
SLC30A5, FU20920, ACAA1, EIF3SI0, SEC5, SLC31A2. MGC10993. VEGF, P4HB, TFPI2, DKK1, ARPCI A, CHST1, 
MAF, FU90798, KIAA0682, GRP58, CACNA2D2. MAPKAP 1 , GPR27, ICAM 1 , RPL39L, C YP I B I , P1GO. KIF5B, 
HSD1 1B2, CLDN3, FU20255, SNX16, FKBP10, STK23, DRD2. SPA 17, FOLR1, WNT16, KJAA1010, FU1 1467, EFNA4, 

40 H3FB. RAB5C, EHDl, SLC7AI 1, RHOBTB3, COQ7, SLC21A1 1, FU14827, SPRR1A, PVR, MAST205, CFLAR, PAX6, 
N33, ADAM 10, GNAl 1, ZFP26, GPR48. KRT4, C2, CRIMI , MGC3121 , FU23471, GGCX, PPP4C, PAWR, PTHLH, „ 
K1AA12I9, SRP72, ETV6, ALOXI5B, SLC24A3, SLC25A4, RDS, DAXX, ICAM1, LOXL1, GMDS, TRAF4, NTHL1, " 
LISCH7, GAS2L1 , TRIM 10, SIAT4A, FU22584, SLI, 1TGB5, TFPT, CD8A, DSCR1 LI , KIAA0779, GPRC5B, PP591 . 
SEC3 1 B-l , PPFIBP2, CYP27B1 , DOC-I R, COP9, KIAA1 1 93, MST1 R, HBS1 L, RARG-1 , FZD7, KIAA0626, SMT3H1 , 

45 RALGDS, SOX13, FU226I2. NFE2L1, CST7, KCNJ5, PALMD, KIAA0644, MRPL9, ERCC1 . MSTP9, PTPN3, SUPV3LI , 
GAL3ST-4, SUHWI , PRSS16, C6orf9, PTPRT, CGI-1 12, TBX3, ARD1 . KDELR3, CGA, TSPY, SPAGl , CRELDI , FU20967, 
RNASE1, LRP3, LARP, SOX1 1, TULIP1, RORC, HARC, RPL5, FU13544, MAP3K12, K.IAA1096, PLA2G10, RAB2, 
FU12681, FU23469, PP1057, MAPT, TMEM4. PSME3, FU21963, SGCB, GLI3, PRRG2, MYL9, GFR, HOMER-3, 
PDGFRA, DPP4, D15Wsu75c, KPNA1, SGCD, RABGGTB, MMP24, FGL2. ATF6. STX10, ARHGEFI2, UPKIB, EGFR, 

50 MCAM, CYP3A43. FCGR3A, FU10534, FU12571, FU20422, CD80, KIAA1023,C2lorfl8. H4FH, TEL2, MSCP, PEX10, 
B4GALT2, ADAMTS5, CSGlcA-T, TNFAIP6, PRKCDBP, TRIPll , PTN, FGD1 , NPEPPS, CAPN 1 , H2BFH, LOC5 1 337, 
FLJ21736, VAV3, FU1 1198, KIAA0923, NONO, ALDOB, AQP6, FU20315, PHLDA1, VDR, K1R3DL7, YBX2, DUSP3, 
MGC1 1271, CHST6, MGC4171, PL6, SH3BGR, SPPL2B, EPHA2, CRYAB, MST1, RGS16, CLPTM1, MD-2, K1AA0152, 
PACE4, DKFZp564K142, RALGPS1A, DKFZP564A022, RTN1, LAMB3, PLD1, SERPINB5, ENSA, DKFZP586N0721 

55 PLAA, FKBP14, LRIG1, RARA, BN51T, PTHR2, PPP1R3C, HSPC002, CNTNAP2, HNF4A, CHI3L2, TGFB2, CG1-58, 

PPFIA1, K1AA0440, PLAUR, SNTB2, 1D1, ALOX5, IGF1, OPCML, TAGLN2, UBXD2, M 1 1 S 1 , REPS2, BCHE, SRD5A1, 
TED, EIF5, KIAA0595, BAIAP1, K1AA1718, TRA@, STS, CI lorf!7, ASNA1, MAOA, PTGER3, NPY1 R, SMARCA4, 
PGM3. PCTKI, MATN2, FU23393, MGC282 1 , MGC2376, FZD2, SLC7A6, PPAP2C. PHKA1, GOLGA1, WARS, 
GADD45G, LIV-1, NEK1, C22orf3, VAMP4, C18B1 1, MGP, KIAA0040, IGU3, FU21 125, BTD, G3BP, CLEC1, NUP98, 

60 MLN, NRXN3, FBXL7, DLGl, PLA2G5, CYP26A1, OR52AI, DSC3, PPAP2A, C20orfl 2 1 , UBE2H, EEFI A2, ATP I OA 

TFEB, GABRQ, GFPT2, WIG1, FBLN1, PTPRF, MEPE, RAMP3, COL13A1, SLC6A8, PPP1R10, COL18A1, GACI , EPHX1 . 
CI lorfP, OSF-2, ETS1, INS1G1, FUI01 1 1, CEACAM7, DCX, CI4orf58, MIRO-2, SRPX, EPHA1, CRK, CPE, TIMM17A, 
LGN7, CENTG2, FU10534, C6orfl8, FU12671, VEGF, SPANXA1, MECP2, EPHB3, TSTA3, ILVBL, F7, BAZIB, MGEA5, 
E4F1, PPP1R13B, PZP, KJAA0913, CSRP2, DKFZP564K2062, CA2, SLC7A8, BNC, ADAMTS1, PIASY, MGCl 1061, 

65 FER1L4, FKSG28, ZAP128, FU21610, ATRN, NEU1 , H2AFA, IL10RA. BNIP3. NRPI , WISP3. C8orf4, TGFA, FU1 1526, 
MRPL2, HP, DHPS, SLC7A8, GPX5, PLXN3, CDC34, POLR3K, FU1 1506, KIAA0980. PDCD8, EVI5, CST3, KIAA0752, 
C1orfl6, CYP4F3, ROR1, MAP3K9, HSPC121CDKN2A, CAPN9, DUSP8, APOD, CCRK, DDX26, USP21, PP35, ABCA1, 
IGHG3, IL1RL1, ELOVL1, HPIP, FUI2650, KIAA1078, IL17R, H2AFN, FU13352, ELK1, TPM1, TLN2, PPIC, SLC16A3, 
FZD3, CARS, TNFSF8, zizimin 1 , GALGT, DSCR6, TP53TGI, SPTAN1, FBXL2, H2AFX, HMGE, TCEB3, PLN, FU10847, 

70 SNAI2, STC2, MACFI, ARFl, UGT1 A9, PCDH7, MAN1C1. NESG1, EVFN1, FKBPL, KIAA04I7, VDR, SPUF, SCGN, 

1GSF4, ARKS, F5, LIMK2, POP3, RGS5, LOX, ADORA2A, PEXI4, VAX2, RANGAP1, MSF, TNFAIP1, C6.1 A, ARHGEF7, 
LPIN1, KIAA0876, ZFX, FU22635, PLIN, TRIM2, EDG2, POF1B, IF2, PPPIR9A, ANG, STC1 , DNAJB2, ODAG, 
KIAA0763, FUI 1274, FU20151. MARCKS, ECGP, MFNG, COG7. K1AA0429, NEDD4L, ATP6IP2, DONSON, MUC6, 
PTGES, SO ATI , MAN1B1, TNFRSF9. SEC61 A2, KIAA0500, AP3S2. K1AAI089, B4GALT4, PTGER3, TLR2, FCGBP. 

75 ZDHHC3, KIAA0716, MMP12, CYP2A6, GRAF, LOC54499, NNMT, COL8A2, OXTR. NOL3, ZNF79, HRASLS, HAMP, 
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AIFl, CGI-38, SPUF, BAZ2A. FLRT3, PDEF, PDK3, SLC4A7, HMOX1, IFNA2I, HKE4, CA5B, KLK8, PLUNC, NCBP2, 
KJAA0703, Tl A-2, MSX2. FU20374, ANXA2P3, DLG3, PON2, 1L17BR, AGRN, PRDM1 1, TNFRSF6B, STXBP2, PTGDS, 
MARCO, UBE2G2, EPB41L1, PDGFA, IL13RA2, CXCL6, CGI-96. APOAI, MRF-I, NPAS2, MRPL41, LENG4, FGF1, 
TRAM, AM BP, GPLDI, CH13L1. AQP1, SSB1, K1AA1608, MEIS3, FU13385, IHRAPL2, NQOl, MINK, KJAA0843, 
5 DKFZp564A176, MOP3, BGN, B1G1. FU131 10, dJ222E13.I , SWAP70, DKFZP586L1 5! , TBC1 D2, MAGEA3, ARF3, 

CSNK1 Al, KRTHA6, FU21034, GPR58, K1AA1735, MGAT4A. GNA1 1, SLC4A2, H41, HAP1, CYBB, MARK], GRJT, 
ETFDH, FUS1, PTN, FUT2, CDSN, MAP3K6, CHST8, BENE, ATF5, ENPP2, PEX13, PAK4, CUBN, SLC39A2, MY06, 
DRIL1, SELT, SLC25A22, HFE, KIAA0237, PKD1 , NPAS2, ZNF3, FU23516, S1X2, L1MR. STAM2, NEIL1, VIL2, MATN3, 
FU23537, AADAC, MCAM, GPR65. TP53TG1 , CAP350, CYP17, EMS1, DKFZp5470146, TNS, MGCI3523, ASTN2, 

10 TRA1, NPY, CEBPD, PNLIPRP1, PNMT, TM7SF2, NCF2. AP4M1, ITGB4, SLCl 1A1. LIM, CBFA2T1, FU20184, RAI14, 
WBSCR20B, BAIAP2, COPS7A, PNMA2. KIAA0923, PACE4, FIJI 0261, K1AA1395, EDNI, ADAM DEC 1, LTBR. 
K1AA0509, R1L, LPPCALD1, MCRSl, HML2, FU22965, FU21870. ME1, FLJ22405, RJTI. FUI 1565, K1AA0481, 
FU20627, XLKD1, RAB5C, AMPD1, PDCD4, BMPR1A, SLC26A6, KIAA0939, FU10874. KCNK15, ARHGEF9, HDLBP, 
MCF2L, AQP1 , FUI3055, PVRL3. RNPEPL1, GPC4, ADCY9, PTPN13, MGC2656, TSNAXIPl, AC02. IRX5. IF2, CIC. 

1 5 KIAA0976, BDH, ZFPM2, PSEN2, C20orf46, NDUFS8, GGA2, FU 10490, TPD52L1, HLALS, ALFY. FU20699. UEV3, 
AES, DKFZp76IK1423, JAG2FU13195, DDX8, G0S2, ITPK1, SEMA6B, SLC16A3, CCL18, HUMPPA. EIF4GI, HRHl, 
GSA7, FASTK, HBP17, FU141 17, LOC146542, APPBP2, TNRC15, CLDNI 1, SCARA3, H2BFJ, APEG1, PPP5C, TDRD1 , 
IRS3L, IGF1. PDGFRL, MUC13, DUSP10, KPNA6, FU22795, OASL, HRMT1L3. MOS, SCGBIA1, PEXI 1A, ARHD, 
KIAA0977, MMP24, FCN1, ACPI, LAMP3, AKAP6, ALDH3BI, TNXB, NF1, APOAI, RBP4, CLTB, GP2, FBX02, DRG2, 

20 DLG3, PCDHB3, FOLR2, NCBP1, SOX 13, HOXD4, FGR, EFEMP2, KJAA0625, TULP2, GPRK5, EVIN1, CHODL, CDH8, 
FU22173, OR10J1, 1FNGRI, PRO 1787, ACADSB, LAMP1, HSPB7, PCSK2, KRT6A, C5R1, DUSP5, MGCU36, TPSDl, 
HMGCS2, BCAR3, MOCS2, KIAA1233, VSNL1, UBD, ANGPTL2, GENX-34I4, FU12547, HMGCS1 , KDELR1, CPT1A, 
VAMP2, GSTZI, GJB3, MRPS12, PCBD, FU23322, PASK, ARGBP2, SEL1L, FST, FARP2, HSF2BP, CGI-96, MGC2601, 
PBX2, FZD1 , ABAT, TSHB, KIAA0874, RHEB2, FMOl , NCDN, CSPG2, KIAA0844, FU2253 1 , COL4A3BP, ACE2, NAV3, 

25 SULT2B1 , TETRAN, RODH-4, MADH1P, HT009, ACR, CLECSF12, SULT1 B1 , ELM03, NICE-i . HSA243396, NDRG2, 
GSTT1 , BLAME, TAPBP-R, SERPINA 1 , CNNM4, TCF3, SSX5, MPDUl, CHP, FU! 1 183, NOL6. FLJ23129, FUI 1 196, 
DKFZP761I2123, KNSL3, DTNA, BDKRB1, CSNK2A1, 1D4, OCLN, CLCN2, SLIT3, MAPK7, EZF-2. GYG2, K6HF, 
ALS2CR3. TMEM2, NPAS2, HOXB9, MAN1BI, APOBEC2, HFSE-I . DNAJC7, POU5FI . PSMB1 , PAFAH2, FU13852, 
CCK, PITX1, NTE, ABL2, CLN8, K1AA0819.GALNT10, FU13841, NEFL, ARHGAP12, APOC2, PTPRO, HSPA6, NMB, 

30 OR2F1, MPP2, HPGD, CALB1, ADRBK2, AMBP, PPP1R1A, CCR7, C20orf28, TRA@, EFNA3, CX3CL1 , F25965, CD2BP2, 
CDC42EP1, OLFMI » C20orf31, SNAPC3, MIRO-2CALB1, PIK4CB. FOXA2, CI 10RF4, RRAS. HUNK, TGFB2, RBMS2, 
MASPI , ATP6V1C1, NMU, PCDHGAI. SLC29A2, PPIE, GGA2. FU20535, POU5F1, MGC5509, CITED I. ATP6V0E, LIPE, 
ACTN1, SLC26A10, SLC21A9, WNT4, RBMS2, MRPSI5, P8, KIAA1609. FBXL1 1, TGM2, CHRNA1, TSSC4, SBBI31, 
KIAA0356, OLFMI, SEMACAP3, CD6, ITGA2, GTF2H1, FA1M2, FU21313, STATS B, TBX2, GABRD, AVIL, 

35 MGC26I5FJX1.FU14675, IL1RL2, AK3, ZNFN1A3, SSPN, RELN, SIGLEC7, COL5A2, HLA-DOB, SLC12A3, HFE, 

PLINP-1, Apg4B, MGC39851, HIPK2, HSPC159. PSK-1, ABCA12, MMP15, PKP3, HERC3, RECQL4, DKFZp434C0923, 
UNC84A, FTS, AZGP1, FASTK, ARFGEF2, DSCAM, MED8, SPP2, P2RY6, RPIP8, DHPS, ST14, SAMHD1. MGC32043, 
SPARCL1. FU22160, GHR, YAP1. MTMR3, SLC20A2, PARTI, PTPN14, BAIAP3, EPPB9, EDI, TPM4, TEK, PR01942, 
H2BFE, LEPR, NAPG, MGC29761, SLC34A2, ZNF358, GRB14, CMKLR1 , KIR-023GB, MET, PBX1, CYP2D6, SLC7A8, 

40 IL13RAI, ARNT2, GTF2H4, CD86, BM88, CEACAM1, BIRC1, CAMTA1, PDZKI, MOCS1, GLYAT, ChGn, RQCD1, CRA, 
BAIAP2, PTX3, CYR61 , VAMP4, HSPA4. HUGl, GBL, EPS8R3, PTPRU, DLGAP1, GEMIN7, MADH6, PTPRG, NFX1, 
KIAA1028, RNASE6, AD037, PI 15, SN All, LOCI 57542, ACTG2, SLC35A3, SIRT3, NPR2, NPC1L1 , HOC, DDR2, SLC5A2, 
OASIS, FU2I5H, LRP2, RGSI0, ALDH8A1, COL4A3, GS3955, CLECSF6, UP, MKL1, MADH6, PRDM5, WNTI, SPAG4, 
SORBS1, ASPH, PLK, IGSF1, ARHF. CAPN2. LIG3, SULF1, CCKBR, TEAD4, C8A, MGC 10771, FCGR2A, SEC14L1, 

45 KLKl I, SPIN2, C8orfl7, THBD,-FKSG28, NEURJU FU10647, LTB4R, CHRM4, C3orf4, ALLC, SLC3A1, SLCl At, 

MS4A4A, EDNRA, ILT1 1, IGHMBP2, MGC4276, IGF2R, FLJ20421, PBX2, 37872.00, FU23604, FOXI1, LUC7L, CD86, 
PVR, SCD, GPR37, UNCI 19, NXPH4, FCGR2B, S100A2, MORF, BMPR2, AKTl, FIJI 1715, IL13, TADA3L, NFATC4, 
PPP3CC, CARM1, PTGIS, PLOD, CD36, BBOX1, VNN3, AKR1B10, S EM A 6 A, E2IG4, HOXC13, RNASE4. 
DKFZp434H2215, EKI1, MGC5356, KIAA0752, RUNX2. ACCN2, GALNS, CABYR, PCDHA3, SSX2, GOTINPAT, 

50 COR02A, DGCRI3, CAPN5, GPM6A, GLRB, NPEPPS. RIPKI , CYP-M, GLRA3, B1GM103, UTX, NY-REN -45, ATP1 A3, 
ANXA2P1, IL1RAP. PRO 1 600, WNT2, HYALI, SH2D1A, TREM2, TUB, KIAAI036, KCNBI, CNN1, BLAME, PITXI, 
DXS542, ADORA1, TNXB, GABRE, FABP3. PGRMC1, FU20513, SCIN, FU13052,CP, LIMKI, MSF, EDN2, FU20623, 
ESRRG, KIAA1237, INADL, K1AA0889, HS3ST3AI, FU22593, ASIC4, FU21 144, FUI 1827, TAT, FU20584, SMA5, 
NCOA3, GLP1R, PRODH, FABP3, FDXR, DEFA4, SORBS1 , MRPSI2, HSFI, EEF1 El, CTLA4, WDR4, ASB7, ABCA8, 

55 CLPS, PSMA7, ARHN, PEGI0, AKAP12, MGC12904, FU10312, FUI 1539, RAD1, SERPINF1, MGAM, PVT1, PTHLH, 
STS. PRG4, SYNCOILIN, CASP2, FU12168, MARCKS, HTR3B, RECQL, COL4A2, CD97, TR1M36, MGAT3. GRIN1 , 
SOX4, KIAA0475, DKFZP586M 1 1 20, SLC2A4RG, CTSZ, SQV7L, PLD3, OR7E24P, CDK5, GRIA2, PRLR. MHC2TA, 
CST6, LOC56920, NUP214, BET1L, FIGF, THBS4, HLA-DRB4, CAPN6, TLR7, MBTPS1, KIAA0992. BG1, FU12681, 
MAK, APOH. TNFAIP6, CRYAA. PKD2, IGFBP2, TSPAN-3. ATP6V0E, KIAA1579, MGC20727, KIAA1093, LOC55565, 

60 HS322B1A, LOC51285, STCI, KIAA0992, CGI-01, TRGC2, EPHB4, DES, CNOT4, MAP4, CDC42EP2, HSD3BI, RDH5, 
XYLT2, CHRD, SPBPBP, PDP, MYBL1, HPN, GOLGA2, LOC63929, EXO70, PCDHB1 1, KIAA1036, ANGPTL4. 
TNFRSF10C, EVPL, TEADI, SIAH2. PMMI, DPYSL3, FU 14297, TACSTD2, BSN, FAP, SEMA3A, RERI, AXL, PROL4, 
CASKIN2, RENT1, CLDN3, DRAP1, ADAMTS7, TCEB2, EPB41L1, GUCAI A, FU22659, PAPPA, CBLN1, FRCP1, IL1F9, 
ITCH, MMP26, STRN3. CEBPD, COL21 Al, BTD, KIAA1034, MIG2, FU2059I. FGG, ASCL1, CXCL14, PDE1 A, OR7C1, 

65 HLCS, PTPN21, HUMMLC2B, SECP43, BCATl, DRD2, TAT, MSR1, OMD, IGFBP4, C13orfl, FU21919, FUI1807, 
AMELX, KIAA0346, FU21916, OLIG2, LI CAM, TAPBP-R, Cab45, NR1H2, TCP 10, KRTHB5, PCDHA9, TNC, 
DKFZp434L0850, FUI 101 1, SKD3, SPINK4, DZIP1, FU23548, FU23420, TFEB, PCDHA6, LOCI60313, FU10496, 
R29124J, THPO, AQP6, K1R3DL2, MGC10848, C21orfl8, ACCN2, TBLIX. RAB6B, BHMT2. APOB. IGSF4, PAPSS2. 
RBP1 , TCF2. R30953 J t CD3G, ZXDA, TNFRSFIOC, FU2I665, CYSLTR2, IL6ST, ZNF214, AICDA, PTAFR, FU128G6, 

70 BA526D8.4, CYP2C9, TWIST. PPP2R5C, MASP2, DUSP9, CGEF2, GABRB1, CDC42BPB. TNFRSF5, CCR4, PYY, 

PILR(ALPHA), BIRC7, LANQiWN, HlAPI, PUC1I, O0OI, TAEi PDCD1, lilM , PK**%> PHUNJ, RlMt, JI4DM, 
WNT2B, NRBP, CDH6, G6PD. Clorf22, LSM4, STX6, ZIC4. FPRL1. CALCB, AGPAT3, SHB, TOM 1 , AG A, ZICI, SIAT9, 
PTPRZ1, MSC, DKFZP566F0546, FU32069, CD28, PPP2R3A, ASTN2, ARHGEF1 1, JPH3, FU2I477, GH1, H0XD3, 
MS4A2, SVIL, DPYS, F2RL1, ECCF1, PRCC, P0LD4, OAZIN, CHRNA3, KIAA1000, DKFZP586D2223, DA24, WNT7B, 

75 MUC4. GCNT3, ORI El , CLSP, CDt D f CCRl, ORCTL3, EEA1 , SIX3, FU10140. FU10884, HNRNPG THSD3B2, 
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SERPINEl, RHO, MUC4, PTN, DNCL12, TNFRSF10B, LOC90326, NR6A1.NCYM, SCGB1D1, EPHB1.NOX4. DJI2208.2, 
PLAUR, PDE4C, PIP5K1 A, MGC14799, IGFBP1 . IDUA, IGHM, NAPA, PARD3, LIM2. ADD2, HSF4, CABP5, TF, TNXB, 
NET-5, ITGA3, IGFBP3, GDF10, PRB4. KCNF1, ATP1 1A, KIR2DL2. SMARCB1, MBP, IGL@, NFATC1, CDH16, RH06, 
CCL20, FU20605, ASIP, LDB2, HCRTR2, HOXD3, GPR87, VCX-8rLOCl 16150, TPM3, LRP1B, MAGEA6, FU20701, 
5 PAX3, IGSF6, TOMM22, GALNT3, CHMU COL6A1 , FAAH, B7, RANBPl, KIAA0876, CYP2A13, CD5L, C2Iorf2, RYBP, 
GJA10, COL15A1, TEX 13 A, SCNNIB, TRD@, RJL, ITGB8, PLEKHAl, GRJN2A, FSHB, PDK2, SAST, PRPFI8, FU 13479, 
GRP. SLC4A8, SMURFl , GK2, 1NSL4, FU2031 1, GLRA3, KIAA0828, DLX2, EPOR, RRBP1 . SDC2. ziziminl , CCND1 , 
P2RY2, CD28, B4GALT4, ARHGDIG, TBL3, IL17, FU20519, FAT2, UPK1 A, SERPINA2, CD209L, NRP1 , ACINUS, 
RREB1, TNFRSF4, PR022I4. DKFZp761O01l3MAP3K7. SPRR2B, DNAI1, NOVAI, DEPP, LOC51725, SCAMP-4, TLR4, 

1 0 MAX, PRDMI6, KRTHA5, PCDHB1, GNAL, P37NB, ISLl, SH2D3A, TFPI2, CREBBP, ACTAI, ALP, OR1 A2, CGI-58, 

SH3BP2, APAFI, CD209, DKK4, IL18RAP, ESM1, PAX2, EV15, MFNG, ATF5,CUGBP1, FUl 0376, CMKLRI , SLC23A1, 
MGC34772, FU23033, 1GU3, AMACR, S1N3B, CCL18, CSPG4, FU20241, DNM1, FHR-4, GNS, GDF1 1, PAL, PPFIA2, 
CASPIO, ORM2, SPTAN1, SPUF, CALCRL, USH1C, ALK, FUl 1850, FOXD1. SH3BGRL3, MNDA, EPB41L4A, MMP16, 
ANK1, WISP2.GSTA1, FER1L3, MGC33190, DAZ2, CHST3, DRF1, TM4SF9, CDC25C, ACVR1B, LU, SGCE, POP2, 

1 5 PCLO, COL18A1, TSHR, Eu-HMTasel, MSR1, GPD2, CLDN17, K1AA1069. CYLC1, ABCBI 1, MIG2, LY6H, ARFRP1 , 

BMP2, ACOX1 , FZRI , CAMK2B, HUMCYT2A, LILRB5, ENPP3, IL4, SCN1 1 A, CALU, IGKC, THEA, OPRL1 , KJAA 1 053, 
SIX1CABIN1 , SCN7A, THOP1. NR2C2, FU23462TRPM 1 , RAB3D, CREBL1, ABCD2, VDU1. GAL, CPN2, FUI0408, 
PHLDA1. RABIA, HAND1 , MGC5347, BAI2, EDG8, GPR30, PCDHB8, TYR03, PRO0618, PRKC1, UCP3, GSG1, 
PRO1048, HRH3, SARDH, FU10803, WISP1, PRLR, RIPX, NNAT, SFN, APBB2, TLL1, PCNX, KYNU, MKRN3, 

20 HGC6.1.1, PLN, RIPX, CDC2L5, ATP 1 1 A, SPI1» RIGPDK3, AFAP, KJAA0427, CYP4F1 2, EFNA5, FU1 1 125, DUOX1 . 
FU21240, DNAJC9, RQCD1, DLG5, PIGO, ABCB8KCNA5, KIAA0409, FU12891, SHMT1, DNALI1, POLYDOM, 
PFKFB4, SHOX2, DGKE, ELF2, MUC5B, WHN, SCAND2, LOC160313, FU23510, AK5, FUl 1871, ITGB5, CPS1, DBT, 
CDH17, FCGR2B, PCK1 , PLXNA2, ACE2,CD7, FUl 1619, ZDHHC1 1, FU21562, FU2021 1, MGC2821, FU20624, ICK, 
PARK2, PNAS-4, CLECSF6, PCDH1 1XFGFR3, PTGER3, PROX1, HRC, EPB41L2, KIAA1 1 17, ATSV. LAMC2, 1TGB1, 

25 TRA@, PAK2, DKFZp762Cl 86, OCM, HNF4A, AVPR2, FTCD. TNNI3, HR, SLC35A2, PPI665, GA, RGS5, OPLAH, GDF1 , 
OR3A2, FOX03A, TNRC21 ABO, ITSNl, PVR, CNGA1, UPK3B, PCDHBI2, ALCAM, HFE, KCNJ15, KIAA0997, RGS1 1, 
NDUFB7, ADAM28, FU13055, PR02176, CACNB4, RIN3, SLC5A7, FOXH1, PKDREJ, FU10232. DGKA, rctSDR4, EDG2, 
SEMA3E, SARCOSIN, THPO, PTPN21, POU2F3, MAPI A, ZFP37, SUPT6H, ADAMTS6, ASMT, DKFZp434C0328, ROR1 , 
FU22800, VAMP1, KIAA1654, RBM8A, EPAG, TNIP3, INSM1, XRCC4, IL6ST, UNC84A, UBE4B, CAPN1 1, NPEPH . 

30 TAS2R10, FU23093. NPPC, PTPN21, SLC22A8GAD2, LOC51063, OGN, MAGEA8, GUCY2C, NT5E, SGCG, CSorfl, 

LGALS2, PRKAR1B, DEDD, PPARG, PDGFB, PRO0461, ALFY, TNFRSFl 1A, DNAJC9, KCND2, PEG10, SPINK1,GCM1, 
VHL, CLDN1, PRSS7, H4F2. D21S2056E, CXCR6, LIFR, KIAA0599, TNXB. EHD1, ARNTL2, CGR1 1, SOCS1, PKLR, 
ZFP318, ZF, CHRNA1, DKFZp434M0331, DES, TMOD3, SP140. KSR, BS69, IREB2, PAWR, CACNA2D1, C21orf62, Gene 
Symbol, OAZl , CFL1 , RPL28, JAM 1 , CGM 1 9, NICE-3. RNP24, JTBFU1 2806, ARHA, FU1 3352, SYNE- 1 , TRPS I CGI- 1 1 9, 

35 NDUFB9C20orfl 14, JAM1 , RALA, FU30532, PIGR, MRPS24, MY05B, LOCI 55465, STUB1 , MGC 1 4353, ARF1 , C20orQ4, 
EGRI, ANAPC1 1, MRPS15, MIR, PIGPC1, MRPS21, CL25084, H41 LOC124220, RAB10, B4GALT1, PPP1CB, MGST1, 
TCEB2, MGC19825, HSPC163BACE2, BR13BP, FU1451 1, MRPL47, NMES1, FU14735, DAD I , KIAA1324, EN AH 
PSMB2, RHPN2, HTPAP, DKFZp761P0423, C20orfl08, MGC45416, TMEM9, UBQLNISTK35. APOAl BP, GRLF1, . 
SPEC1INSR, LOC150678, SMP1, FU321 15STUB1. HLA-C, ORF1-FL49, TAF10, RAB40C, DPP3, A1BZIP, LOC55971, 

40 SSR3, ATP6V0E, SNX6, SNAPAP, ALS2CR9, KPNB2, EPC1 , NTN4, C20orf52, H2AFJ, UGCG, 1MAGE3451454, EEF2K, 
MRPL14, E2IG5, MRPL36, GPCR1, E2IG5, MGC14I51, RASD1 , CGI- 141, AGR2, K1AA1437, HSPC210, BTBD6, H2AFJ, 
MGC14151, FU20048PSMB4, MGST1. FU31364, EGLN1, MRPL53, LOC88745, IRX3NFKBIEUNC5H2, TAF13, RDH-E2 
MGC 12966, DKFZp434GI71, GUKI, FU20671, FU20623, CAPNS1, PFN1, KIAA1671. FGG, HI 9. C20orfI49, CAPZA1, 
RAB18, FU23153,CGI-19ABCF1,TCEA3,NDUFB10,NDUFBI0, RNF7MAL2, NUCKS, RPL23A, LOC51290, TMEPAI, 

45 APH2, FUI3593ATP6V0B, TLP19, SLC17A5, ENPP5, C20orf24, AKIP, D1S155E, FU20171, MGC39329, MRPL41, 
NDUFV3, KIAAI096, LRG, BPNT 1 , LOC5 1 255, CISHPGKl , PLEKHA 1 , HSPCA. COPZ1, DKFZP434L1435, TMEPAI 
BRI3, AKIP, KIAA1 191, LOC92840, CLDN12, FU14525, C20orfl49, CDC42, TMPRSS3, LOC199692, FU22I74, 
LOCI 13246PKIB, RAP2B, HIBADH, LOC57038, FU 141 17, EDG3, MBC3205MGC2550. RCP, NUDT5, LOC51260 SIPL 
KIAA1223, HINT2, HNI , ERdj5, PHP14, MRPS36MRPL32, C6orf49, CAPN13, MIR, RNF19, ATP1 1 A, LOC51 128, FLVCR 

50 ADCY4, KIF5B, ARV1, RAB5EP, PX19, RREB1, MIR16, LOC51248SMAP-5, SYTL2, FUl 1320, MSTP028 OCLN 

MGC1 4833, SMBPRDH1 3, MGC40107, KIAA1 165, SPPL2A, Cab45, MGC20781 , LOC51 241 , MGC1 1266. DKFZP566J2046, 
FUI4624, CKLFSF6, LOC147184, DKFZP566F084, FU20203, FUI0856, MGC1 1034IMUP. CAMK2D, MK-STYX, 
RAB3D. C20orfl42, DNAJB1 1, MGC23908. FU 1 0074SURF4MGC 1 1 102HSCARG, MGC14327. HYPK, HSPC121, 
TOB1SRA1, MGC14832, JAM1, MGC27385, PX19, FNTB, MIR, LOC56932, POSH, MPP5, MRPL52, MIG-6, LTB4DH, 

55 ZAK, FU22649, SCGB3A1 , MGC33974, FU2I016MGAT4B, KIAA1404RBMSI, DKFZp761H0421. ARHU. FUI2697 

CGI-149, SPUVE,T1NF2, RPL17, LOC54516, WTAP, MAGI-3SAMHDI, FUl 101 1, FU10052FLJ23751UCK1, LOCI 70394 
TP531NP1 , HOXD8, XPR1 , MGC10540, SORBS1 , BCC1PFLRT3, FU22558, FUl 1200, SAMHD1 , PIGR, FAM3B, CYP4Xl' 
NFIA, KIAA1715, FU20160, CTHRCl. DKFZp547A023HSPCI21, LOC84661, LOCI 13386SH120, GNPNAT1, FU32499 
UBXD1, LOC90120, HBLD1, MGC13186, SPEC1, MYBBP1 A, MGC4248, DKFZP434I1735, LOC127018, FU37318 

60 FU20421. PTGFRN, p25, PIGM, MGC43399, ERdj5, SYT13, IHPK2TH1 L, FU20727, POLE4ASHI, KIAA1 130, LOC55829 
MGC10084, ZPR9. K1AA1458, CNN3, WASLFU20097, SURF4, HSPC163YAP1 , H4FH, MGC40214K1AA1200, C20orfl39 
PKIB, CGI-36, CLMN, SET7, SEC10L1. MGC22825, FU10525. LOCI 1 3386SELENBP 1 , SLMAP, VPS29, KIAA1972 
MTCH2, NPD007, OLD35DNCL11 MGC 14839, SHI 20 UBPH, APOAl BPLANPL, UBQLN1, FUl 1 101, C8orfl3, 
DKFZp434 A24 1 7, CI 4orf3 1 , C 1 4orfl 00, MMP24, CR1M 1 , FU23393, MGC457 1 4, INADL, SE1 1 , OPN3 , CG1-97, MGC2 1 874 

65 Cl4orf47. KJF3B, FUl 1046, C(27>3BETA-HSD, RAB18, IR1899308, MGC17299KIAAI223. K1AA1322, RAB23FU32205* 
DKFZp434K1 14, EHF, ShmiL, K1AA1434, K1F1B, EROIL, MGC 15397, BATS, C20orf45. FU31235. LOXL4, FU20707 
Cab45RNF7, MGC2803, FU36445. CLDN1 , DKFZp761 N0624FU20308, MGC33338, MYOSBRBM8A, MGC10765 
Cl4orf9, FU32642, ATP1B1. MGC4309, K1AA1272, LOC154467KIAAI483, UBE2H, EHD4, UBE2J2, FU20085 ' 
DKFZp762H185, MGC20486, MGC26847, MGC15854, LOCI 15265, NEK6. SPRR2AMGC 13045, MGC4604, LOC5I256 

70 ANKRD9FU31 208TR1M47. AP IGl DNAJC1 DKFZP434I1 16, LNX. SDCBP2MacGAP, FU14957. C20orfl 10, SURF4 * 
RAB5EPC12orf4, GL004, DC-TM4F2, SAT, DKFZP434A0225, GK003, dJ55C23.6, JUB, LOC89894, LOC1 15294 
C20orfl29, PCDHAI0, HSPC242RAB18, COX 1 5, MGC1 1115, MRPL27, MGC 15397. FU11 752, LOCI 16238, C9orf25 
LOC51760, MGC45408TBX3, HSZFP36. TRIM8MGC22793, BAL, FU25I57, C20orfl55, RPL35A, ZNF265ILF2 
MGC23166, FBX06, K1AA1870. DKFZp761D0614. ZNF398. ALS2CR9, MGC26818, EMS I, FU901 19, OALNT4, 

75 LOC54516, BRI3, HSCARG, PPPIR1B, GPR54, FUI4299, PPP2R2A, MGC5391 , SDCCAG28, PHP14, TGFBR3, MGCl 842, 



36 



3NSDOCID: <WO 2005005601 A2 I > 



WO 2005/005601 



PCT/US2004/018266 



MLLT4, DFFA, SELM, MAPKAPI, MGC10974, AD-003, FU10902, MEF-2, MURR1 , MGC2541 , GSR, MGC19825, MAFB, 
LOC139231, FU2309ITEM8, RERGKIAA 1 553, CFL2. CEBPG, KJAA 1 554, SEMA4BPDCD4, PNAS-131. MGC31963. 
HT002, HRD1, MESDC2, PRO2605, PTGFRN, KIAA1244, MGC10999, MGC107I5. CG1-85, KIAA0779, NUCKS, 
FU13881, LOC127829, HR, K1AA1538. KJAA12S5, STUB1, K1AA1841, CALM2, RJG-I, HOXB8N4WBP5, HTPAP, 

5 CXCL16NAC1, TRAB1D, LOCI 35 154, TKIM56, MK-STYX, Eu-HMTasel FU 30794, D1RC2PTPN23, GBP2, TRIM 1 1 , 

KJAA1976, MRPS26, TMEM9, FU23420, MGC148I7, MK-STYX, IDS, EPI64, K1AA1724, MGC2477, FAD104MGC32065, 
MRAS, DKFZP761L0424MGC4840, FU20739.GFRA1, FU23867, MGC40555, FU 14251, FU38628, MGC2941, 
MGC22805, NOL6, MESDCl, FU22865, FU25357, DLG5 ARHGEF5, HYPK, DHRSX, PCDHB2, FU90165, 
C17orf26PVRL2, DKFZP564D166, NOR1, GLIS2, SPPL3, TTC8. FU14502CED-6, MGC14141, MLZE, LOC57168, 

0 KIAA1337, KIAA0217, CRB3, KIAA1350, PPMl AFU20273CCU8, POP, MGC14859, GJB2, GPR, ECGF1LOC92399, 
HOXB9, LOC90522, KIAA1951, MANBAL, MGC1 1386, RIPKl , NLNHCC8LOCI 1 5548NUP88, TMEM8, CHDH, 
FU20507FGFR1FU30803, KIAA1280, FU13089, LOC120224, ZNF75A, DNAJC5, SDOS, MRPS15. MGC2628, FU1 1236, 
TRJM39, NESHBPFLJ 10839SULF2., FUI0210, METL, FU12707, HUMAGCGB, FU13195, FU21016, BOK, FBX025, 
OSBPL5, DKFZP434N 1511, KI A A 1813, VANGL2, LOC124446, HDCMAI8P, C20orf7, MGC1314, MS4A6AANLN, 

5 MGC40499, K1AA1337,FU101 16.NOTCH2, RRP40, PFKFB4FU 1 468 1 , K I AA 1 02 6, C 1 orf6, MGC5384, LOC85865, PHAX, 
MGC 1 1 1 34, FEM 1 A, LACTB, TIM50L, ARNT, MS4A6A, PP1L1 , C20orf3, MRPS1 5PGGT1 B, CXADR, LBP-32. FU22004, 
FU32069, UACA, MGC2747, FU13187Clorf28, CBX6, CIorfl3, NY-BR-t, FU20748, KJAA 1 82 1, FU3 1751, LSR68, 
TRAD, USP28, FU10702, GBA2, B7-H3, DKFZp547D065. TH1L, TSGA2, RORC, ETL1, FU30634, MGCI0702, 
TEX27MGC33602, MGC2555, LOC55893, LOC128439, EDIL3, KIAA0146, RFXANK.HS6ST1, NEK6, FU20186, 

0 MGC15416HSPC159, SCAMP2, LOCI 336 19, NGEF, CI4orf58LOC9l012, MGC 12972, MGC 1 1034, CYT19KJAA0819, 

LOC55893PHCA, KCNK6, CR1PT, CDW92MGC3195, GTARPAPOLG, MGC24180, KJAA1 126MTA3, MGC24103. moblak, 
MS4A6A, DAG 1 , KIAA 1 394, MGC13U4, KIAA1337, FU40021, DPP9, KIAA0789ZNF144, TMP1T, MGC13H4SYAP1 , 
FBX032, BOCCD44, LSM10, KIAA1673, CTL2C21orf63MGC2560, ZFP385, TM4SF9, DNAH5, PGGT1B, 
DKFZp586M1819, 1D4, CL1C6, C20orf64, YAP1, FU21615, GRP58, LOCI49267, C20orf7FU37933, FU90586, FU22626, 

!5 LOC51242, MGC4604, SDCCAG28, K1AA1321, TEAD2, RPS3A. LOC90701, FU32915, FU3I434, PUNC, TRPS1, 

MGC45441, LIN7B, DKFZP434H0820FLJ32468, DNALI1, COX4I2, HOXC9, FU20337CLMN, BCAA,OPN4, DGAT2, 
PRDM6, DKFZp761 J1523, K1AA1244, ICMT, FGF1 1. C21orf97, C20orfl69. VPS 18, SIRT2. MGC15677, MGC4604FHOD2, 
DKFZp547M072, CGI-125, NLN. MAP I LC3 AFLJ3 1 842, PGLYRP, FU32069, DKFZp547A023. MGC39325, RRP40, 
KIAA1880, LOC116254LOC51061SYTL2, K1AA0076, KIAA1580. GPT2MGC4840K1AA1345FU12577, Tcnr, CCT5, 

>0 FANCF, USP21, K1AA1273, DKFZP434F091, MGC13007MGC16131, SEC5FU22215, FBX022, MGC 1649 1, MGC 1602 8, 
MGC2601MGC15906, C20orf45C17ora8, IL17BR, STKUIP, SEC61A1, STAU2, FAPP2, FU25429, CAC-1, ROCK1, 
MS4A7, DKFZp434D0215, FU20442, HFELOC148523, LOC90353. HIPK2, ERBB2IP, CDKN2B, CGI-09, DPP7, DUSPI6, 
CGN, CLONE24922MSCP, DKFZp547E052, MGC45714, MGC5370, MAP4K1SLC4A1 1, MGC26568, PPIL2, MGC27034, 
FBXO30, DKFZp547C195, MIC2LI. DHRSXHTPAP, VIK, FU23841, DKFZP434D146, HPS3, 1PP, SEM A6ADN AJC5 , 

\ 5 ULBP2, LOC 1 20224, FU 1 1 036LOC90580, LOC92906, WDR5, RAB35FU 1 0697, M APT, FU 1 4825, KJAA 1 295, MGC22 1 7, 
ACTR8, SENP2, LMLN, LTB4DH, MGC1 1257, MGC15476, SART1 , TNNI3, LOC128I53, SCDPR01912, KIAA1896, 
LOC80298, FU20533,SMCR7CGl-691^U4977iaAAt691,C20om02,VIP,FBXW5,TRlM35.SLC30A5,JAGlSLCUA3, 
COQ4. OVCOV1, GLI4, RPC8, FU31 153, C20orfl62, NRP2ENAHARH2LOC5597I , FU20038C«CAM, UBE4B, 
LOC57 1 68ALS2CR9, SLC2 1 A 11 , GPR 1 08MRPL4 1 , KJAA083 1 KJAA 1970, DKFZp762I 1 37INPP4B, ZFP67HSPC 1 89, 

[0 PFlPCDHB6C2orf9KIAAl468, FU14399, DKFZp434GU8, KIAA1S00, FUU681K1AA0869FU22558APXL2, MGC16028, 
APMCF1, LOC90990. PCDH18, DKFZP564J0863, COGl UBE2H, KIAA1970, CTSB, MGC30052, FU90575, MMP28, 
MASS1, MGC1 3034, R1PK3, CCT4FU12519, GOLGA3RCPCP. MGC20983, FU35207 EML4, TRUB1 MRPL41ZNF213, 
RP42, FU20813, SAMHD1, KRTAP4-8,C4orflFBX08, EPB41L4B, ZNF75A, STK36, PAWRFUI, DKFZp761A052, 
C20orf23, AKIP, MGC4643, VTI1A, LOC223082, PDK4, PSMB7, KIAA1710, MGC13272, MGC4342, GNGI2, N33, 

l>5 FU14800, FU21924, LOC220074, FU22474, DKFZP434P 1 06, FU13236, PTENP1, FU21 159KJAA1441. CGI-85, FAM3D, 
DKFZP566J2046, LOCI 16441, TEAD1 LOC51249. PARVA, HSPC230. MGC5442FU23091LOC55893, PDCD61P, OGN, 
TRJM41, MGC42 1 05, CPNE2, DKFZp547J 1 44, KJAA 1 784, KJAA 1 337, SLC30A 1 RNAC, KJ A A0429NRXN3, Spir-2, GGCX, 
KIAA1694, DNAJA4, CAPN13, NAP1 L, RPS27LTMOD4KI AA 1 557, FU21415DKFZP564G092, CLN8PARVA, 
FU40021KJAA1708PC326, NOSTRIN, LOC129642, KIAA1 301, CGI-85, MGC13102, LZIC, KIAA2025, FAPP2, FU22679, 

10 SNX8, ZNT6. DUSP16, PANK2, FU14834, DKFZp434C0328, ROD1, FU34633, FU13391, ARHJ, FU1 1753, B29, OSAP, 
B2M, CYGB, DERMOl, MIR, WDR20, C20orfl55, FU329I9, MGC2408, CLGSCYLlDKFZp761A132, DKFZp451GI82, 
FU90119, FU3699I, SDCCAG43, PPPIR16A, MGC19764, FU13263, GNG2FU12517, MRPL20, MGC16212, SRAl, 
GEM1N7, FU37953. HBP1, KIAA 1737, CTL2, KIAA 1 754, FOXA1. MGC 13096, HDAC3BOC, FU30973, BRUNOL5SEL1L, 
SPTB, POU4F1, K1AA1337, MIZIP, NAGSCGI-72. PR01853TRAF4, MGC32124, SNCAIPDKFZp434O0515, SMOC2, 

>5 FU12770LOC1 13828, FU40432DKFZP434K0427, SFPQ. RNB6, BMF, GSH-2, REV1L, SLC27A4Clorn9, 

SLC5AIKIAA0478, SPPH1, FZD8. M GC26877 LOCI 503 79STK36, LIMD1, KIAA1694, FU25357ELAVL2, BM-002, 
ProstcinFU20374, STK35, FU31434, CHRMl, DLC1, FU36155, FU21939, MGC21675LOC51320, FCRH3, FU10948, 
MGC27034, MGC14801, MGC1 1 102, SEC14L2K1AAI393, DKFZP434A0225, DKFZp434F054. SHANK2, OSGEP, 
MGC45474, ARHGAP8. BTC1L I F7GRLFI , DKFZP434BJ72. MRPL35, PAPOLG. MGC33662, XP05CTEN, DSCR9 t 

)0 1TGB6FU 14768, STEAP2KIAA 1 522, FU32069, PCDHB3C20orfl 36, XRN2MARKI , DKFZp5470146, FU 1 25 17, 

FU 10597GK001 , CITED4. IGL@. GALNTl 3MGC26963, RASAL2FU20605, LOCI 1 2609, NLGN3. C7orf2, HSPC182, 
DTNASNX9, ALS2CR9KJ AA 1 2 1 9KIAA 1 1 90C 1 4orO 1 HSPC065, KI AA 1 22 J » FU 1 0252, C4orf7, K! A A 1 363. NCAG 1 , 
NAV1, Cl4orf28. KLP1, ZDHHC9, MGC2615, SMUG1, PHLDA1, AD-003, BRPF3, ASCL2MGC 15523, RELA, ROPN1, 
FZD4, ZDHHC4, KRTAP3-1, PCDHB16KJAA1036. SLC2A12MSTP043, FU32731 AMID, FU30277, CKLFSF2, TLR7, 

)5 SEMA6DNOPE, DKFZP434P01 11, SDS3, KSP37, PDCD6SNX14, A1BG, SLC3 1 A 1 , MK-STYX, SNTGILOC80298, 
FU25534, MGC10485, FU10035, NEUGRIN. BK65A6.2, NKD2, TJP2TRPS 1 FU20753, PPP1R1 A, LOC123169, 
LOC112817, ZNF341 , TM4SF9, FU90586Spir-lREN, FU10210,CEGF3, NOXA1, FU 14981 , R1MS1, PCDH20FU20360, 
DKFZp761H0421, MSX1 . DKFZp434F2322FU10l88. SPP2, MUM2SYT12, pknbeta, MGC1 1349, RNF40MGC4734, 
MAP1LC3A, FU13687, CNTN3, MGC 19604. TLR8, FBXW7, HM13, TLEIAKJP, SMURF2, FU21963, MRPL44. PRKAG3, 

^0 DREV1HSA243666, FENS-1LOC51693 FU10486, HAVCR2, HDAC3. AHRR, CXCL1 4, CGI- 

09MGC 13251 DJCFZp434E232 1, CI 4orfl 02K1AA 1 434, PHCAKIAA 1 1 45, FU00028, AMOTL I , TMPRSS6, ODF3, 
MGC4604, DJ667H12.2, VGL2FU10052FU13881, UK1 14, DSG2SLC1 2A4TBCD, MAP1B, OSBPL10GALNT10, 
DKFZp547l094MGC35352OSBPL6»TRIM7,FU30473,MGC2562. DLG1, DKFZp434P0531, K1AA1554ESDNKJAA1910, 
SEC15BKIAA1 172DSCR1L2. PSMB50SBP2, GPR34, MGC15854, HDAC5LOC90990, DKFZP564B1023, 

J 5 CASP2NUPl33Sptr.2, LOCl5l534,C22orf23, FU908U, DKFZp434II930,NET-2, LOC1 13026, HOOK3MOC8721 , BLVRA, 
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PLA2G12, DAPPI, FBG3 MGC10974, LOCI 14990, DKFZp547M20l0, FU20542, LOCI 4445 5CG 1-94 BRUN0L5HKE2, 
PRND, WFDC3FU30990, FU23654, KJAA0876, NDUFS1 WASL, KRT61RS. KIAAI684, RU2, DKFZP434K0427, 
DKFZp434B217, K1AA1549 DKFZp434F2322. MGC4126ENTPD2, GPRC5C, RGNEFFLJ3 1 052CEGF3S YN2, 
C! 1ORF30MGC3038, ITGAl IKIAA1053LOC57822, LOC130589, RASGRP4, DICFZp434H21 1 1, NFIA DKFZp434C0328, 
5 FU20209, NDUFS2SENP8SLC2A4RG, p25, C20orfl67K!AA1909. MGC4238, MGC16372. CDS, IGKC. KCNQ4, 

ZD52F10CCL28, FLJ20539KIAA1357, EPB41L4B, MGCI4128, SLC1 A5RHEB2, HSPC182, FU22527, MGC2I621, 
MGC5370KIAA1130. KIA A 1 554C9orfl 1 FU31937IMP-2C20orf5l , KRTAP17-1, DICFZP434E2318, 
DKFZP564BI 162RPL29, PROU89HSPA9B1CJAAI688, KJAA 1 324NCOA5, AXIN2*. LOC92689, KIAA1272FU14642, 
FU37440, FU 12604, RGS8, MS4 A6 AZNF2 1 6, LOC84570, KJAA 1 126, SMOC I , TSCOTMGC 1 8257, RDH13, 
10 C1QGKJAA1576, ZFP28GNA14, FU39155FLJ32069LOC155066. MGC19764FU10159, MGC16309LOC55862, 

PCDHB 1 437867.00, LOC56851, SNRK, MGC13017, ADAMTS 1 6AGMATPCDHB 1 0, LOC1I3179, NOL6, C20orf55, 
PALMD, GFER, BNIP-S, KJAA 1 337AXIN2, MGC39807, LIP8K1AA1053, MGC45378FU1 1273, 
FU23l29DKFZp586I1420KJAAI210COX7B2,TCF7L2, USP21, DKFZp5640l278, 

FAAHDPCR1NUMBMGC35285JUBEVXI, LM04AMOTLI, C2ori7TMPRSS3, ARHGEF7CSRP2BP SBB131, SSBP4, 

1 5 FU23654, CPMDKFZp762K222, DPP9CA5BKlAA1817CI4orf92, MY03A, VIK, CACNG4, NYD-SP21 LUC7L, SFRS12, 
LIPHDIS3, GCCI . FU10504, CASP14, KIAA1387, DAB21P, K!AA2028C20orf40OPR92FU32658FU25027, UQCRC1, 
EVC. COG1FU25555MOV10 ALDRJ-6, HTGN29MGC 12466, IBA2. MGC15548ADD3, GSN, 
C14orf50MGC22805MGC39650, KIAA1203FU14311, HRMT1L1, MASS1, CGN, IGHG3, ESPN, ZDHHC12, 
PCDHB4THRSP, FGFR2, LOC91461 FU25604DRAPC1, ARL8BACH1 , KIAA1921,GPR81 , KJAAl 145ARHGEF7, retSDR3, 

20 C20orflSARFGAPINSElTPSGIMRPL4, KIAA1870, X102, KIAA0599, CACNG6. FU22301, ZIC2, KIAA0599, MGC4796 
HT036, DQX1, SYTL4ICAP-1 A, KJAA0350. K1AA0872, GMPPB, FU37953. LMLN, NOL6, POLR2J2Hcs4, LOC57038, 
TRPM6,ABCC13, BCAR1FU30803FU32069KIAA1909, TIMM8BEML4MGC15606MGC35048, NRP2 PC A3, IL17BR, 
DKFZp727A071, MGC14128, GABRB3, MRP63, PGBD2GATA5, FLM4735. ENTPD6, SYNE-2PRIC285, MGC2555, 
LOC90378GLC ATS, GCN5L 1 , DKFZp434F2322. MSCPFU3068 1 . ZNFN I A4PRO097 1 TTTY6C 1 4orf47CTXLFTCD, 

25 MGC2835MGC 12435. STYXFU12076C20orfl06TEXl 1MGC19825, TPM2HOXD10. K1AA1554, FU20014, FU20748, 
PPP1R14C, ARHV, ALDOAEGFR-RSC20orf92FU14594MSCP, PRO0038SLC25A15, RSP3, PPP1R9A. 
EPHA7MGC3552 1 GFAP, ICEBERGFOXP3, FU33516GPR55, ZNF398, PROI635FU33903FU32203, ORMDL3, 
LOC51315, FU32752ELP2LIMD1KlAA1357DOCKl,FU14721,STClALAS2,HMT-lPADn,PTPN23FU102l0, FU10826, 
ELAVL3, LOC90668 FU32069. NOL6, LGALS1LOC55971, FU20273, SSB1FAD104, GPR107TRA@, SORCS2, 

30 LOC91010FGFRL1, UQCR, SEC14L2, D ENRST6GalN Ac I , KISEGLN1. ZNF219SNAP29. TNKS2QP-CSLC4A1 1, PURB, 
KJAAl 163, FOXP1, C12orf22. TCF7L2, CDH23. FLJ13955KIAA1828, FLJ33008LOCI 15704, SLC13A3ASB1, 
DKFZp762I194, CPNE4, GR1N3A, MSTP043, B HLHB5 A DM PRB M 6, MGC13275, KIAAI889, KRTAP3-3LOXL2, 
LOC51290, CI lorf23FU20309, MGC26778NAV1 , ARHUFU23749, FU33071NUMBL, PTPNS1 L2MGC3040SMAP-5, 
MGC2835CDGAPCHFR, FU90440. DKFZp434G0522FU 10300, TR1P1 1, HSFY, HOOK3, GTF3A, FU12634, 

35 NEOl TEAD2PTPN2, BCL2L1, KIAA1557 KPNB2. ACPP, CISH, DKFZP434P106, ASPH, DOTlL,FU22944SRGAP!, 
OLFM2, SIN3A, ASB12, CECR7MGC40397NFKBIA, POLRMT, CGM49C21orffl4, MTMR9, GATA4, XYLT1 , 
PCDHB7SEC15L, C20orfl60 MGC33302Clorfl9, COL12A1. EGLN3. FU21032MGC3040, ODZ2, ING5, 
CI2oraHS6ST2AQPl, MGC10981MGC33607FLJ14399PRACDCAL1, MGC40222, TMOD3, TEFSDS-RS1, 
LOC115098KIAA1573MLL3, FU14103AK3 ARPM1, CARD14MGC12916, ALS2CR12, FLN29, FU12697TOB2, 

40 N33GTF2I, BHLHB3GPC6, CAMK2D, KRTAP4-13, BDP1 , DKFZp761 H079, DKFZP564J047CED-6, EB-1 , MGC4659 

GPR1 10, DOCK1 , FLJ2021 1 , SCN1 1 A, LOCI 18471 , LOC151568 ZFHX2 SLA/LP, PCANAP7, HDAC3, POU5FI , GGTL3, 
C7. FU25410, SCAND2, C20orfl36, FU21616, EB-1, FU25067, KIF9KIAA1276, LOC55864, FU32771, DKFZp667B1218, 
DNAJC9LOC51319, FU10902. FU36525, MESDC2DDX1 2MGC33993 KIAA1399, LLT1, DKFZP434F091, 
FU1 2697GPR24, SE70-2, NANSFU12571, IL-1 7RC, TRJM7, NXPHl , ROR2, C20orfl50, K.LHL5, ZNF265, BECN1 SCARA3, 

45 PRO1580, MGC35392DKFZP434N178, PEX5R, FU31528, LOC135763CLECSF9, MGC41906, FBXOl 1ZNFNIA4, 
SPINOFBX022, IHPK3 C20orfl 67M AP2F[U25270, STRBP, MUC1 3KIAA 1 878, SNX9MGC26143 K1AA1887, 
KJAAl 71 2ASB4, BRUNOL4PDE11 A, ARG99. FU30162, ATP6VIG3, MGC10702. 

ARSDKCNJ2CAMK2DMGC12335KIAA1617HNTEB-1, GRP58, C2lorf59, KIAA1720, LOC221468CCL27CGI- 
62 MGC 10204, TNKSIBP1RRP40, FRABINDLX6APOA1FU30532, FU23403C7orf2 DNER, PDEUA, MAFMGC 14276, 
50 DLLl,LOCI46542,SH3GLB2KJAAl952LOC93109ENDOGLYXlMGC10724,lUn,CGI-105,CI4orf44, 
PAX6ASAH2MGC 12435, PGA5. and AGPAT3. 



Table 6 

55 Down Regulated In UPTG Verses UPNTG 

CD24, HSPD1, E1F3S6, T1MM17A, DENR, PAI-RBP1, KIAA0101, H2AFZ, SLC38A1, HNRPHl, RPS1 1, DEK, ZNF13I, 
HSA9761. MGC3077, CD24, CCT6A, RNPC2, ANKT, CSE1L, RABGGTB, HSA976I, SIP, HMGB2, SEMA3F, H1NT1, 
HMGB1, SERPI, RPL27A, FH, DUSP4, SET, K1AA0179, HMGN3, TOP2B, OAT, NUDT4, PCNA, BMI1, SIP, SDCCAGl, 

60 PBP, MAC30, SFRS5, ATP1B3, EIF4E, CRABP2, LRPPRC, DKCl , MRP63, STK6, CARD10, MRPSl 8B, TCF3, TCF3, 
MGC2747, FU20422, IF2, NCL, EIF5, TFAP2B, TIMM9, PPPICC, ZWINT, HSD17B1 , ATP50, CBX3, CRFG, PXMP4, 
UBA2, RNASE3L, USP7, LANPL, PTTG1, RANBP7, YES1, CDC2, RBM15.GMPS, PSMD1, TCF3, HSP105B, EMS1, 
NONO, TOMM20-PEND1NG, LDHB, DKFZP586L0724, DDX27, JMJ, CENPF, LRPPRC, ID4, EIF1 A, PSMC6, ID2, 
SECl 3L, TYMS, LUC7A. SNRPA 1 , RRM 1 . RARG-1, SM AP, FEN 1 , TCNI , ZNFI 46, ABCEI , DC8, MTCH2, FU201 52, 

65 CCNB1, CICS2, FU23445. TDG. DNMT1, MAC30, RPA40, GMNN, APOBEC3B, STMN1, EIF1 A, MTHFD1, MGC5560, 
USP1, ZRF1, EIF5A, WDR3, FU20530, RPS21, BAZIA, MCM6, M!CB,OPAl, LAMA5, ECT2, RAD21, RNASEH1, 
FU13081, STXBP3, PAI-RBP1, OSR2, FU20006, K1AA0I86, Cl9orf2. NUP107, TAF2, GCSH, FLNB, ZNF363. SEMA4C, 
RAE1, GSS, NEK2, GTSE1 , PAI-RBP1 , ABCEI , FU20986, MAD2L1, VEGF, LZLP, K1AA1025, KJAA0092, ANP32B. 
SRRM1 , NXT2, TOPBP1 , FU20485, SFRS7, SMC4LI » CPSF6, LIN7C, FARSL, NDUFB6. FU 1 2888, LANPL, ENDOFIN, 

70 KR18, FUI 1029, DLG7, WDR12, DC12.CDC5L, SLC35A3, PIGF, PRKRJR, MTOI, CASP6, FUI 1 149, FU22637, LDHB, 
PPID,GTPBG3, HMMR, SLC31AI, POLE2, KIAA0984, DJ434014.5. RAB6K1FL, ASE-1, HNRPA1, FU23468, CALR, 
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MELK, SLC25A13, TFDP1 , RES4-25, DCI3, CGI-1 11. ARH, FLJ14547, TSN, CYP2B6, PDX1 , LCE, FANCG, DHFR, 
KIAA0020, QDPR, MTIF2, HLXB9, SART3, JAG2, CKAP2, PRCI, SNRPDI, LOC51 184, RAN, OLD, PREI3, SRRM2. 
RAD1, CCNB2, FU13657, K1AA1 1 16, RACGAPl, FU13576, DKFZp564B0769, RFC3, KJAA1630, CCT6A, TIP120A, 
RUVBL2, FU23277, DDX 1 8, PMSCL 1 , LEPROTL1 . SCGB 1D2, TIMM13, C4orn , KRTHB6, DD5, C 1 D, PNN , ORC6L, 
5 K1AA0I70. ASK, DLEU1. SFRS3, SLCI9A1, HIP2, PPP2R1B, BIRC5, EPS 1 5, MGC13138, HNRPD, STK6, HSPA8, 

METAP1.KIAA0776, HSPCI2S, K1AA0419. MAGOH, CHORDCl. APPBP1,UBL3, RAD51, LOC5587I, GLRA2, CUL4A, 
ARHGAP8, KIAA0648, COXI7, SUDD, RAP 1 GDS 1 , FLJ 1 4639, BCL9, EZH2, TRIP13, FU1 1210, TOM M 70 A, PTP4A1, 
AMD1 , DUT, KPNA2, CYP3A4, RFC4, OPA1 , RNF6, IBTK, LBR, MGC13138, KIAA0097, KJAA0532, OIP2, VRP, HDAC9, 
KLC2, FU20700, AD24, ALMSl, FU21901 , DKFZp547P234, FLJ10656, TOP2A, MYC, TAF4, POLR2E, K1AA0528, 
1 0 CRY1 , MST4, ETFA, HOXC6. MTX2, HMGCR, RPC5, TOPK, DKFZP5641052, CENTA1 , FU20758. KCNMA1, KNSL6, 
CG1-30, MRS2L, PAICS, ZNF85, DJ434014.5, RABGGTB, HEYI, KIAA0485, KTN1, KJAAI0I2, CDC20, 
DKFZP434L0718.CEPT1.MYNN, FU10637, ANXA9, RNPS1, RBBP4, SSH-3, LOC90355, CAMLG, KPNB2, FU23259, 
VRK1 , FBXOS, HSP70-4, DNAJC9, MYCBP, Si 64, NTRK3, TAF9, SPG4, DKFZp667G21 10.CDKN3, INHBC, PEX1 1 A, 
CDC27,HMGB3,THOCl,FU12151,DKFZp564B0769, HSU79266, DMN, C10orf3, THOC2, NDUFA6, GCSH, PPAT, 
1 5 RHAG, SMC2LI, SE70-2, KPNB2, LSM6, FU10377, IL1 RN, K1AA0547, FU14007, SCLY, KIAA0379, UBE3A, 

HTATSFl, LOC51685, AGL, BET1, FU13782, UMPK, SMARCE1, LSM5, CENPF, EEF1EI, TPT, FU10719, IF2, CGM2, 
UCHL5, FU20628, ERN2, BLM, FLJ21940, PDCD2, STRJN, UMPS, MRPS30, APBA2BP, TCEB1, CREB1, MGC9084, 
NOLAl. BUB IB, MGC10471, RFC5. RRP4, FU13187, CCT5, HSA6591, CHAF1 A, FACL3, IMPAI, FU23558, CDC25A, 
CDC5L, BTN2A1, FU20422, ELF2, DKFZp586F1019, FU22624, LOC51659, CRFG, WHSC2, HNlL,OAZ3,CDl A.CLPX, 
20 CABC1, CLASP2, HSPA9B, KJAA0007, SLC1A3, NPM3, SUSP1, SLC16A5, M6A, UBE2 J 1, TBC1 D4, C20orfl , TBXA2R, 
UVRAG, MLH3, FU20331, PEG10, PRPF4B, KJAA0332, MPZL1, KPNB1, FU10204, TFAM, FLJ20281, FU10604, LAT1- 
3TM, KIF2, RBM12, MK167, HRB2, KIAA0056, ZAP3, COX1 1, SNRPDI, AMD1, TRN-SR, FU20641, RB1CC1, KIF4A, 
FU20093, TPR, RAD50, PPP1 Rl 2A, HNRPD, PIR51 , PSPH, TTC4, HIC2, SLC39A4, RLF, KNSL7, NOL3, ZNF-U69274, 
EIF4ENIF 1 , PDCD4, CTSC, CYP2C9, KIAA0677, BCLl 1 A, LOC56906, TIA1 , SYN2, RNAC, RDX, FOXM 1 , HRASLS3, 
25 STAG2, HMMR, KJAA0376, CAPN10, CHEK1, NICE-4, MRPL19.TSN, DKFZP434MIS4, PPID, NEK4, SMC5, MGC1223, 
SUV39H1 , ESPL1 , RANBP2, FU2301 8, SNAPC4, LGN, HYA22, JAG2, KIAA0644. NPR3, FOP, PKMYT1 , APPBP2, 
HSPC1 35, C20orf20, EIF4E, ZNF239, FU20909, CNTNAP2, ZNF292, LIPT1, FZD7, KJAA0971, SSH-3, MRE1 1 A. 
KIAA0090, PAWR, SMC2L1, CGI-1 12, SOX13, HBB, KIAA1 193, CAP350. RRS1, MTCPl, HBAl , GRPR, LCT, RAD51C, 
PRKDC, SPAG5, POLQ, BRCA1, GNAI3, FUI4346, ZNF24, CENPA, E2F3, DDX 18, SFRS2, PSP1, FU14827, BFAR, 
30 FANCC, DMXL1, CUL3, C6orfl5, BCLG, SIL, LOCI 33619, MGC2306. K1AA1096, GMEB2, ASCL1, EBP. FZD3, PRDM2, 
KNSL1 , FJX1, PPP1R3D, SRP72, DKFZP564D0462, CCNF, PA1-RBP1, PR01496, RBBP6, TEB4, SP192. DCTN4, 
B4GALT2, SRF, ZNF200, DNCLI1, SCYE1, PPI5PIV, FU22087, SLC29A1, FU12439, VDR, TIMELESS, TAF15, CGA, 
FU21816, SHMT2, SRISNF2L, DKFZP547E21 10, OIP5, MGC2603, FIJI 1896, C18B1 1, IGU3, PPARBP, DCX, TAF5, 
MGC5306, LIM, PTER, PP1L2, FU10998, NSEP1 . KIAA0332, MCM4, DLAT, KIAA0453. RPL23AP7, TTF1 , WRN, TTK, 
35 MARK3, SF3B3, FU20552, TIMM8A, PANK3, LIN7C, FU20225, FUI0287, MFN2, FU21908, REV3L, MGC5566, ZNF42, 
MSH5, HCAP-G, FU20591 . SPHK1, E2FI, FLJ14054. CCNE2. MGC4701 , Clorf33, BITE. MCM5, KCNK15, AGTPBP1, 
FU20274.CLPTM1, LANCL1, FU20125, FU11785, BARD1, MYOC, RB1CCI, FU23151 , RFC1, SLC25A12, FU10330, 
TMPO. KIAA01 57, STC2, UBCE7IP5, MGC5306, COL13AI , TMSNB, PTTG3, FU40452, MADH6, IF2, SRP72, FU2O0O3. 
USP2, YY1, FU23053. KIAA0276, TIA1 . P RDM 10, OXTR, HRASLS, BAZ1 B, M96, SLC7A5, CYP26AI , PBI , TCBAP0758, 
40 TLE3, POLD3, LIV-I, HNRPL, FUI0407, CHAT, UPF3B, RAMP3, TIMM17A, G3BP, PCDH7, FU90754, MCLC, EPHB3, 
STXBP6, CSTF2T, GYG2, PRKCBP1, RRN3, FBXL2, MDM1 , PNN, SMPD2, TTF2, TFR2, GDAP2, FU10989, MATR3, 
PROl 598. PAF53, OGT, HNRPH3, H326, VDR, KIAA0843, UTX, K1AA1 172. RYBP, FU20005, SCML2. SF3B1 , KLHL3, 
NOLC1. ING1L, KJAA1467, ROBOl, TGIF2. C8orf4, NUDE1, PDCD4, FU1 1004, AKR1CI. DKCl.COCH, FU20666. 
HSPC121, FU10261, PMFBP1, RADI, SLC4A4, FGFR2, SMARCC1, BAZIA, CGI-I30. NESG1, FU13909, GRM6, 
45 FU13942, SOX12, FDX1, LGN, GRIN1 , BTN2A1 , NCBP2, NMU, CDC6, OAZ. CDC7L1, CNNM4, NOL3, FU1O038, 

KJR2DS1 , KPNB3, SLC4A4, FLJ22390. SLC6A13, NY-REN-24, KIAA0923, LOCH 325 1. SIP, ERCC6. DICFZP586A0522, 
RAB1 IB. ZNF197, WHIP, KJAA0040, KJF5C, GTF2H3, PAPA-1, HNRPH3, NDST1, C9orfl2, KIAA1069, MAC30, 
PPP2R1B, ZNF363, KIAA093 1 , NFRKB, MGC12760, HSU79274, SELP, RAB33B. MYH1 1. TIAL1 , MCMI0. 
DKFZP434F1735, KJAA0553, SAFB. FU12455. DRIM, CFLAR, KIAA0542, HTRIB, SMC4LI, TIMP3, MLLT2. 
50 ARHGAP1, KJAA0255. WASF1, POP1, KIAA0286. PASK, DACH, SF3B3. CDC2, RCL, IL2RA, IRX5, DUT, FUI2684, 
FU20640, NSPC1, ABCGl, T, ZNF174. PPP4R2, SPAG4, FU21596, ZNF11B, FLJ13449, HBAl, E2F3, CDC2, BICD1, 
RAP2A, CSTF2, LSM8, DYRK1A, FU21940, H2AFP, DATF1, ANGPT1, C20orf46, FU20147, ZAP,CASP2, K1F14. 
DDX17,TRIAD3, TAX IBP I, PEX7, KIAA0182, TIMM44. CIAOl, FU 13490, MED6, FBLN1, SMN1, OR10H3, 
ARP3BETA, DLAT, TXNRD2, RC3, HUMGT198A, MTHFS, CAT56, CRSP6, DCLREIA, ACRV1 , TAF1, PPAT, SEMA4G, 
CXCL9. CUL2, AGRN. ZFP100, KIR2DS3, RECQL4, PTPN13, LOC93081, IRF4, IGL@, CYP2B7. CLASP1, MCF2L, KLK5, 
COPS7B, B3GALT3, DKCI, YES1, CHPPR, MGC2I654, TROAP, FLJ2331 1, MKL1, K1AA0650, MRPS34, SMARCC1, 
PEX 1 1 A, ZNF2 1 2, GABRR2, NUP98, SIGLEC7, ZFD25, RRM 1 , TF1P1 1 , M96, AD024, API S2, TIMM1 7A, GM2 A, TAS2R1 , 
RARG, GDI2, FARS1, ROB04, RINZF, FOXF2. CASP10. CITED1, RPGRJP1 , PHTFl, 37870.00, PP35, MGC4659. 
K1AA0092. EPHB1, KCNJ10. HOXD4, NUP160. PTPRD, PRODH, PTBP2. PFKFB2, SGK2, ACADSB, BRIX. EML4, 
EDNRA. CHRNB3, NUP155, KIAA0648. SEMACAP3, LOC57406, AND-1, CRSP2, STAG1, SH3BP2, NR6A 1 . MGC2827. 
HNRPH3, SIP1, FU21986, CTH, PDEF, HABP2, RPGR, COQ7, TTTY2, FUI 1767, LOC81691. HSPC1 U, MGC39851. 
TAP2. NUF1P1. GABRA4. CDH2, SMTN, ZNF305, C8orfl, ULBPI, VAMPI. FU20477. LHX6, CD6, NSBP1, KLF3, 
SLC13A3, LOC55862, LCK. CDC25C, CGI-32, DKFZP434D193, MBD4, GNB3, BAIAP3. FARS1. CHRNB1, GCAT, 
KIAA0342, STK.18, MPHOSPH10, CRMPl. UNC84A, CACNB1. KIAA1053, KIAA0953, SERPINA5, FU20433, SIGLEC6, 
DKFZp762E1312, LAT, SORD, GGA2, FU21945, FGFR4, DBRI, LMNB2, ADCYAP1, NR4AI. LIM, AGC1. FDX1, 
FU20244, ZNF24, DCLREIB, IL23A, EIF2S1 , INCENP, FU21820, ZNF264, KIAA0964, CASP8, ORC2L, CHAC. 
TNFRSF13B, MOST2, ABCB9, D103, RABL2A. FAIM. DCT, CLCAl.TRIM29.GK, GNA14, TDPGD, FU20186, RAD54L. 
SSX3, FU10193, HT010. HEC, KJR2DL5, CASQ2, TRA@. ZNF335, ING3, HSPC055, ITIH2, BUB I, MADCAM1, AXOT, 
KIAA0295. RPL17, NRXN1 , P2RX5, GASC1 , NUP210, ZNF236, RAD21, ANKTM1, EDNRA, HSPD1. COR02B, NY-REN- 
58, FKBP1 B, AQP8, K1AA0922, SNRPA1. ARJH2, ASGR2, C6orf35, IL1 RN. SLC38A3, NFYC, CACNG4. SEZ6L, GLP1 R, 
NUF1P1, G2AN, FU13949, FABP7, S100A1, TRJM36, LOC93408, API5, PADI3. TADA3L, EPN2, TNFSF4. MIP. RIPK2, 
F5, KCNJ3, HADHA, MS4AI, NEK3, KIAA0275, DTR, MNAT1, ZNF223, FNTA, NRCAM, POLG2, ADH6, CAP2, KCNJ5, 
SFRPI , APOBEC3C, IL7R, PI25. UGCGL2, ASIC4, AMFR, HSN44A4A, RAB5A. OXCT, RAB3GAP, D6S1 101, OTOR, 
LTBP1, R1N1, LDB1, PRKAB2, K1AA1006, PLK, PRO2000. MOCSl, RGNEF, PDZ-GEF1, IN A. MASP2, RSC1A1, RoXaN. 
CLDN6, HSAJ2425, KI AA0469, ING4, REM, Kl AA0092, SKP2, OGT, CBL, K1AA1 240. QKJ, ETFDH, PPP2R1 B, MDS03 1 . 
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CED-6, SLC1 1A2, GPX5, CRKL, PC4. FU10858. APOC4, CUGBP1 , REG1 B, DKFZP564B147, Cl4orfl04, PAX4, TRA@, 
RECQL5, ENG. CDC2L1 . FU22087, HYA22, DEFA4, GIOT-3, ASPM. ANK3, TNFAIP2, SLIT2, WBSCR20B, EIF5A, 
PTHLH, ATPW, CASP8AP2, HSPB3, RPS4Y, UNC84A, FU20624, CHST5, STARD5. SSX2, 1L22. TAF1 B. FEM 1 B, 
KCNAI , GPRI 5, Clorf34, CG1-07. WDR8, SLA, HGC6.2, GRIN1, CXorfb, KJAA1034. EDG4. CUL4B, CSPG3, TFEB, 
5 P164RHOGEF, FIJI 3 105, CENPE, APP, MYL6, FU23441, PON1, ENDOG, SERP1NC1, PGRMCl, TUBB5, CHRD, PAK6, 
FU20045, PELP1. FU 12735, DXS542, SH2D1 A, PR01728, HOXA6. NEUROD4, CGI-100, FU 13386, AND-1 , TBL3, 
GZMM, FU90005, FGFR1, LOC5I231, FNBP1, PI 1, PPPIR15A, VDR. CPSF6, S164, C20orfl4, K1AA0217, SGT, 
KIAA0332, DKFZP586E1923, FU10884, MCF2, MAP4, AAK.1, HS3ST3A1, LOC90806, ALDH3A2, MUFI, NCKAP1, 
FU10618. LILRB3, GAGE5. TMEM1, CD6, ADAM22, BM039, NEF3, ITCH. PPP2R2B, PLG, SNAPC1, DXS9879E, MPDZ, 
1 0 CDK3. CD209L, SLC21A9, SHB, Rabll-F!P2, MAP4K5, DGKE, MTMR3, KCNIC5. CLCNKA, SGCE, FUI0565, MCM7, 

AK5, NCR3, SERP1NB4, TPST1 , alpha4GnT, NPEPL1 1 PRLR, MPHOSPH9, IL1 8 RAP, PMSCL1 . HS322B1 A, TCF2, TPD52, 
HIVEP2, KRTHB5, KRTAP1-1. DMD, C10ORF6, AGC1, FU23436, PTK7, COL9AI, CGI-01, EPHB6, AVIL, LOC54550, 
NASP, OAZIN. SERPINA6, GPR44, VCY, DIAPH2, 384D8-2. MAPK1 1 , GALNT4, PTGES2. WNT2B, STX6. STK1 7A, 
PPF1A1. CALCA. CCNA2, DOC2B, NID, BAZ2A, WNT10B, FBXW1B, SPRR3, MINK, B3GNT4, CDK6, BHMT, SRPK2, 
1 5 PGCP. CNK, SSB, CDC6, GART, DLX2, PLEK, PTPN7, UBQLN3, 1FI44, TCOF1. FGF16. COPEB, SOCS4, FU1 1222, 

MRPL12, WDR9, DKFZP434G2226, CLECS F9, NCR3, GPR49, EP400, DKFZP586M0622, PCDHA9, C1QTNF3. STAB1, 
PRKDC. BEX1, FZD9, CAPN7, BCR. FU1 1577, IOL@. ARR3, PTHLH, AP4SI, ABCG5. SNTG 1 , CRTAC I , ZNF335, 
FU10979, HSU84971 , POLI, KIAA0643, DKFZp434!19l6, PPIG, TRG<g, MAPK12. INGI L. HIF3A. CDX4, CYHRl , 
TRAP100, UCHL5. CLOCK, SLC17A7, HFL-EDDG1, ATF7, FU20105, HRH4, FALZ, SLC23A1 , NRF1, BTN2A2, 
20 FU20581 , DKFZP761 HI 710, FU10376, GLRA3. C20orf30, C4orf6, ELK4, PLCG1 , CNR1 , KNSL5, K1DINS220, tNG4, 
PPFIBP1, SGSH, PRKAR1B, UBE4B. INSL3. DKFZP434F1735, MTMR8, KRTHA2, MPHOSPH9. SQLE, OGG1, OSMR, 
AFM, HSPBP1. VGF, HCG1X, U1SNRNPBP, FU23447, FU10057, SPRR2B, GR1K3, MARK4, WIZ, CORT, MGEA6, 
BMP7, FU10648, BRAP, DKFZP547E1010,C21orf59, STK6, KLK2, GRIN1. HOXB7. SMURF1. PCDH16, BCL1 1 A, 
SPPH1, FUI2838. SSR3, KIAA0940, P2RY2, HSU84971, ZNF134, CNTNAP2, ADAM23, MAGEA6, SPAG6, 
25 DKFZp761P1010, DTNB, CHAFIB, MLL, DGCR8, MGC3I01. SENP3, FU12331, LATS1. IPP, FXYD2, FU23360, 

FU20898,LUC7A,NDRG4,LIN-28,CXorfl5, FU1 3910, ELK1 , MGC4294.TBLI Y,FU 12985, B7H2, FU1 3693, FU 10945, 
FU203 1 3. DKFZP566C0424, 1GHM, TPS1 , GFAP, PEXI , NEU3, FU10719, NF1C, GTS El , S1AT7D, PDYN. SELPLG, 
B7H2, PIGO, SCNN1D, NMBR, NCAM2, YWHAE, SIPI, FU14084, PROZ, ATF2, PPM IF, INSM1,CABP5, ZNF124, 
SP1 1 0, SPTA1 , MGC2776, BMP8, GAL, SCA7, FU1 1 850, FCGR2B. PROSC. PDE4D, MGC 1 1 335, AKAP3, CARF, DKKL1 - 
30 pending, UGT1A1, SHANK2, LSS, GUCY2F, RANBP3, SLC16A7, PIP5K1A. SCAMP-4, LOC92579, SLC7A8, CR2, 

FU20707, FU21 106, MADH5, CPS1, COL14A1. PROL3, CUL2, CHAFl A, OAS2, SOX10, MFAP4, TCFL4, FU12618, 
SUSP1, MAGEA9, K1AA0322, SLC19A3, AKAP1 1. USP7. DCM, KIAA0616. BC008967, OR7C2, CACNG3, PELI2. 
FU14050, DMPK, FU23071, CCL14. 1GHM, BM039, GASC1, BIRC4, MGC5601, KCNK10, SLC22A8, MGC14817, 
GRCC8, LARGE. ZDHHCI 1 , ANXA13, FU14107, FUI0246, CI lorf5, POLA2, SILV, PARD3, LW-t , CCL1 3, CLCA2, 
35 MEl, RAD51C, SSTR3, STK12. ADAMTS2, MRPS12, SMCY, TUBA4, KIAA0794, CCL1 1, WFDC1, TRY6, MAP2K2, 
ACOX1, KIAA0874, Clorfl6, NRG1, RCN2, CLDN18, MYL3. FU13150. LNPEP. SLC25A21, PDE10A, STAG3, TNNI3. 
CHC1 , MAP3K7, OSRF, HMX1 , HRG, FU1 1 292, PAL, KIAA1 659, VARS2, HSRTSBETA, IL5RA, CYP3A4, FU23556. 
MAPK4, C16orf3, GPD2, HOXA3, MMP7, FU10786, C6.1A, K1AA0892, PCDH1 1 Y, TRB@, METL, PRKAA2, ZNF76, 
FTSJ1. FU90130. FLT3, GNAOl, SCNN1G, TAF9L, PRV1, SNX13, CENPJ, CNNM1, FTCD, NEK1. FU1 1336, FU 14803, 
40 C9orfl6, HIP1. PP1F, GS3955, NFATC3, DOK1, ROPN1, MAGEC1, HGF, PRLR. CTSL2, NKTR, SAA2, HOXD1 1, PROX1, 
MAP3K12, MORF. FU10619. SULT2A1, ERF, DKFZP586A0522, KCNQ2, KIAA1387, DFFB, MGC4172, MOCS3, ITGB3. 
PIB5PA, ZNF1 17, KCNA4, KIAA0999, HFE, CYP2A6, A2BP1. RASGRP2, AMELY, GABRG3. 1TGA8, DUSP3. PTGSl, 
K1AA0748, CACNAIG, CENPC1, POT1, COL6A1, ST7, FU13052, MS4A12. DLG5, TECTA. ETV5. HEY1, NECL1, 
DICER1, ALOXI 2P2, K1AA1025. FURIN, WISP2, CSDA, ALDH1A2, USP19, TRG@, SFRS7, CDX2, MRPS31, NSAP1, 
45 CUL4B, ABCC2, IQGAP1 , WHSC1 LI , ALCAM, SERPINB10, MDS028, KOC1 . ELF2, DKFZP434A 1 022, GPM6B, C2GNT3, 
CYLC1, FUl 1506, CEBPA, LIMK1, CPR2, CLTB, TNR, PLA2G3, GPR30, APOL3, TSKS, HCGIV-6, KCNJ2, MGC5347, 
MAPI A. PPARD, TMPO, LOC63923, CYP2E1, RYK, PRKAR1B, FUl 1336. FU10748. PR02958, CHN2, CELSR1 . LCN1 , 
SLC15A2, USPS, ZFR, CYB5-M, SLC27A5, MJD, KJAA1096, HTR2C, NACA, APC, ELK4, JM I , KCNAB1 , GDF2, ST7L, 
TGT, AMY1A, ESR1 , TLX1 , TBX1 , KIAA0967, K1AA0146, C1QR1, ARHGDIG. KCNIP2, HDAC6, MTHFR, NTRK3, 
50 HAVCR1 , FU22269, PLXNB1. CRACC, EGR4, PMS2L6, POGZ, FU21 148, FU20359, B4GALTI, K1AA1 354, CSF3, 

SLC17A6, PAK2, ZF, CLECSF6, FU2I120, ZAP3, FU20127, VAMP1. DCLRE1C, DRJL2. FUl 1608, SFTPC, GABPB2, 
ICAM1, PRO2405. TC10, XEDAR, CART, L3MBTL, PMS2L3. R32 1 84_3, TCL1 A MIP-T3, FU14639, PLGL, HPGD, 
MERTK, EIF3S6. PPYR1 , RPE, GLS, VAV2, TFAM, SLC6A1, RORA, PLVAP, PCDHB6, HDAC7A, MGC10731, ARTN, 
HAOl , POU4F3. KCNJ4, ATP9B, F10. LSS, MPP6, TG1F2, 1TGA6, KIAA0682, NUDT13, MGC4293, DKFZP564O0523, 
55 PACRG. ACLY. FU14627. OCM, SLC4A5. HNRPF, KRTHAl. FU21940, KIAA0632. SSX3. TNFRSF9, C22orfl9, 

SLC1 9A1 , LSR7, ZFP36L1 , SLIT3, DIP1 3B, C20orf27, ARHGEF2. EST-YD1 , PROL5. RAB3B, LAMB4, PPP2R5B, CRYGD, 
TGM5, ADAM22, AGMAT, PKNOX 1 . DSC 1 , TOP 1 , TU3 A. CACNAIG. IDUA, LTBP4, MYRIP, ABL1MI, CALD1, ZNF46. 
CDKN2C. FU20958, RPS8, MAGEBl, KIAA0683, RHAG, BLu. TFF2, XPNPEP2, TYR, FAP48. NCYM, HIF3A. MBNL, 
LRP16, PLXNC1, LOC51 145, C21orf2. ARHGAP8, FU32069, FGFR2, NICE-4, PRKWNKI, LOC65243, DIOl, MDM2, 
60 PRDM 13, CA-1 1 . PSK, TNFSF15, OPRM 1 , HSPC048, SPN, NBS1 , B1RC4, CDC27, HRH2, TRIO, CACNA1 1, TFR2, HAN 1 1 , 
NEUROD6, CADPS, MGC 1 2386, ORC5L, TNXB, F2R, PR02831, CDH18, FUl 1106, DBP. PAX8, DLGI, CDC25A, 
CEGF3, FUl 0921, HRH4, FU20456, IL12B, CACNAIF, E2F5, PRPI7, LGALS8, MGC3771, SLC6A3, RAC2, KJAA0286. 
MGC12488, NROBI , AD7C-NTP, IGL@, TULPI, PSMD1 1, COL13A1, UBE3B, FU20401, AKAP1. CRTL1 , SPF45, 
FU 10895. CCL1 3. COLI6AI, CHIA, RAMP2, SSTR1 , FYB, TXNDC4, SCAM-l, DYRK1 A, K1R3DL2, CNK2, Di-Ras2, 
65 MCCC2, KRTAP2-4, K1AA0523, IGHM, ODF2, RXRA, GABRA2. CLST1 1240, POLR2A, SRY, TAS2R7, BLR1, 
DKFZP586H2123, FU21007, SPON1, ENIGMA, KJAA0140. RPL5. DESC1. DNAJC9, PTK9, MGC 107 15, SNCA, 
CEZANNE, TBCE. HOOK1, COVA1, C21orf62, AGXT2L1, SLC24A1, SYCP2, C17orfl A, OR5V1, HCN2, KLF12, AIM 1 L, 
LOC5I336, PRC 1 7, ITGB3, PR01992, POMC, PRO0149. B3GAT3, L3MBTL, APG-1 , Cl2orf2, MOX2, ARHGAP1 1 A, 
ATP5G2, HLA«DOA, GPC4, LOC57406, COL2AI, GABPA, SCN4A, RBP4, PHF7, GRID2, OSBPL7, MRPL9, MYH2, TFPI, 
70 FU10159, 1PF1 , IL20RA, THRA, LOX, CMAH, KIAA06I6, CYPl Al, MADH5, FU40021 , FU20069, FBX022, GABRB3, 
CYP2D6, TNRC4, FU22582, NR2C1, PK428, CBFA2T2, KCNK13, DCT, KCNG1, FUl 0648, CENTB1, ADAR3, HTNI, 
PDCD1, TRIP, EFNB1. TFDP2, ATP2B2, TNFRSF7, MRPU, PTP4A3, SIGLEC8, PPP3CC, ENTPD5, BAG5, FU20047, 
GLI2, CCL21 , EPN1 , TONDU, RAP2B, CGI-72. ZNF384, C20orf42, MEF2C, RAB28. TAF1C, USPI8, GPR42, HTR2A, 
PDE4D1P, DKFZP564C196, TXK, H2AFJ, FU20623, GPM6A, FOXJ1, MGC29761, IOHM. RAII5,CSTFl. KIAA0800. 
75 CSH1 , KRT20, RAD51, TAF7L, FU10849. PTK9, RGS1 1 , CDH20, FU20034, RFRP, FOXD2. HSA9761 , PQBP1, DGCR6L, 
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FUI 1 132, OR2W1, CRYBAI, LMOD1, PDPK1, GPR56, KIAA0296, SERPINB13, KLHL5, ZNF79, BCDOl. PSORT, 
EPHA7, DKFZP434J046, PRO0800, SV2B, C!2orf3, SGCA, BMX, MHC2TA, RAD51LI, CYB5-M. VIL2, FNBP2, LEC3, 
RBM9, BRALl, NGFR, DDX34, MAPK8IP2, ANKTMl, DDEF1, ARL7, STK18, AQP4, MDM2. SYNE-1, FOX03A, 
TNNT2, TITF1, ZIC3, PPBP, FU12542, SLC18A1, IGKC, HFE, PRO0038, NPPA, IL-17RC, CXCR3, DOM3Z, GADD45A, 
GL012, CNOT2, TOB2, TFDPI , FU21617, MTRF1, APBA2, TTS-2.2, CNOT4, F9, PR02133, CRABP1, CACNGl, IGFBP5, 
CTNND2, DKFZP564D166, MYT2, EVI5, HYA22, CHK, HSPC073, RRBP1, FOSL2. FU21302. MGC2889, PRKCLi, TSPY, 
MGl, NDUFA5, IL1 RN, CRH f CXCLII, MYH8, PURG, SLC7AI, KIAA0953, ELAVL2, SP100, KIAA0675, MLLT4, 
ZNF198, CD38. BHLHB2, LLTl, FU1O210, PMS2L9, SOCS2, LIN7A, HOXA7, FU10661, ELAC2, CYP3A4, P2RX2, 
MAPK81P3, ADAM28, NPR3, DEF6, UTRN, PHC3, FBN1, DKFZP566K0524, ZNF132, OR2J2, GJA8, PSIP2, EDI, PP2447, 
WSX1, LCP1, MAP2K3, KLF12, TFP1, BTN3A1, GCM2, FMR2, DDX3, PR01768, KIAA1641 , HEMIC, SLC8A1, LA LB A, 
RBAF600, FU10572, MSRI, KPNA4, CIAS1, MEP1B, NR4A2, PKNOX1, GLP1R, FOXP3, (JJ222E1 3.1 , KIAA0471 , KERA, 
COL4A3, NPTXR, KIAA0447, ARHGDIA, ACACB, K1AA0847, CASP2, BRIP1 , LRP8, IGL@, PCTK2, TFR2. PLA2G5, 
HSPC056, IL16, FU12178. TBXI, KCNJ13, WT1, PRKACG, DKFZp547G 1 83 , MY03A, DSC 2, ANAPC2, ALDH1B1, 
CD1B. MGC14433, GPHN, IGHM, GUCY1 A2, HPSE2, GHRH, BAGE, CYP2E1 , GTSE1, MSCP, ADAM 8, PAPOLG. CGI- 
14, S1RPB1, RGN, PGGT1B, ELL, RRP4, APOL2, POU3F1, JAM1, SYP, SERPINI1, FUI 2595, NRG2, PDE3B, H1RA, 
DDX9, LTBP4, FUI 1 783, GABARAPL3, DRD3, XP5, FU20190, TRPC6, ADRA1 A, DSPG3, KIAA0564, KPNB2, 
DKFZP564O0523, UGT2B15, AP4EI, RGS7, ZNF10, PIW1L2, HLF, CYP4F2, INVS, ITSN1, FCGR3B, ARF4L, REL, 
RGS20, EPOR, FU21 168, MSTP9, ULK1 , NRF1, T1GD6, GPR88, DUOX2, GPS, SSB3, FSHPRH1 , RHOBTB3, C1QBP, 
CDSN, FSBP, CFDP1, ELK3, TUBD1, KIRREL. BAAT, CEP2, GGA2, KIAA0874, CRB1, FUU726, P2Y10, PCDH1 1 Y, 
GPM6B, FU10715, TRJM9, FCAR, FGF22, FU13993, DIM1 , GIPC2, KIAA0626, SNIP!, Gene Symbol, LARS, C15orfl5, 
KIAA0783, MGC2714, FU10036, HSPC154, FU10486, FU30596, FKBP5, SERF1 A, REC14, OCLN, FU21924, LOC51249, 
FRSB, AD034, CCNB 1 , FAM3B, MLL3, IBA2, SEPPl,C14orf3l, HMGB1, C14orf35, MGC4308, FU 10407, GRCC8, 
C20orfl29, FU20060, Spir-1, LAN PL, RBBP7, KPNA4, FUI0486, MKKS. SNX5, SART3, FUI4494, FU21087, HOXB9, 
NUCKS, PPP4R2, C14orf47, EHF, MGC14439, LOC55871, AP1S2, TRNT1, FU25059. MGC10198, K1AA2024, KIAA1309, 
HSPC014, LAPTM4A, GPR54, ARL6IP2. DNMT3A, DKFZP564B1023, KJAA01 14, ATF7IP, HSPCB, HDAC3, FU39370, 
FU20093, PP2447, LOC139231, MGC4I917, MGC20262, CSRP2BP, LOCS1 193, GRP58. HEY2, ANLN, UBL5, CDCA7, 
KIAA1 321 , KIAAl 323, UHRF1, HDAC3, KIAAl 91 1 , FU001 66, K1AA1453, DKFZP434A0I 3 1 , NY-BR-1 , 37865.00, Rpol -2, 
MGC5306, BOC, FU25804, FUI4728, BDP1, PSCD3, AF15Q14, HDCMA18P, PRO2000, LOCI52S18.GART, TRIP1N, 
DKFZp3l3A2432, PSA, PGGTlB, MGC4832, LOC85028, FIGNL1, PECR, CBFA2T2, HOXC9, CPSF2, SLC25A19, 
C20orf45, FU32915, ZNF367, PANK1 , LOC131 118, FU14909, MGEA5, TRIM46, RpoI-2, DKFZP434C245, AKAP10, 
CDCA1, H326, DKFZp761 A078, FU20333, NEDD1, AUTL1, TRAP25, KiAAl 143, GPHN, LARS, DKFZP434D193, 
FANCD2, PRO2000, DlCFZp3 1 3 A2432, FU12439, MKJ67IP, LOC115004, FUI 1220, MCMIO, MRPL1, NDUFS8, PHF5A, 
OAZIN, LOC92345, K1AA1708. KIAAl 982. MGC2628, PXMP4, KIAAl 804, ELYS, HNRPD, ZNF6, MRPL42, KIAAl 287, 
TRUB1, TOMM22, FU25070, SPPHI, ZIC2, C6. 1 A, CGI-77, MGC33864, MKI67IP. TUBE, VIK, MGC14798, FU20354, 
KJAA0I40, GTF2H3, FU12787, DLD, ARIH2, KJAA2023, KJAA0864, CDC23, MGC13096, TRF4-2, OSBPL6, MNAB, 
ROD1, USHIC, MGC16372, FU20333, FZD8, MCMIO, FU23445, WDR4, OFD1 , AK2, REVIL, COQ3, ASCL2, EGI, 
TReP-132, CAB561 84, FU13081, HELLS, FU10378, C20orfl 61 , EPHA8, DTNA, HSU53209, NAGS, LOC84524, 
LOC91 120, LZKl, DKFZP434I092, FU1443I, FU20354, HS6ST2, FU20333, KIAA0140, FU23476, C14orf31, LOC55871, 
C14orn5,C20orf42, TBXI, CRMP5, Jade-l , CASPR4, FUI 1132, DKFZp5470146, MRPL50, LOC51 193, FUT10, FU30655, 
SELB, KJAAI524. FU14813, FU38608, TRJM7. SYT12. FANCD2, FU25078, FUI 1294, K1AA1357, STRIN, pknbeta, 
NSD1, DKFZP434B1727, BCRP2, FKSG14, EIF3S9, MGC2744, KIAA1595, C14oifl06, LOC144455, KLK12, KIAA1374, 
BCoR, GABRB3, TIMM22. FU25416, BRUNO L5, MGC24665, ARX, DKFZP434K0427, KIAAl 91 5, C7orfl I , MtFMT, 
FU21439, MAP2K7, DKFZp434H21 II, ARFGEF2, PR01489, PTPN1, MGC13204, FU23322, MGC16386, MGC45866, 
FU30626, CML66. ZNF295, ARL8, LOCI 15106, MGC12466, SNX5. FU22344, MGC10850, AKT2, NCOA5, KIAA1713, 
MGA, FU20032, RNPC2, DKFZP434E2318. MLL3, SYNPR, FU10989, C2orf7, LOCI 15827, LOC91 862, MGC13016, USFI , 
DGKZ, LAMA3, DKFZp564B0769, A2BPI. KIAA1560, LOC221002, BG1, ENT4, RNF3, CHAC, ICAM2, FU10493, 
EIF3S6, TRA@, FU25604, TUBGCP6, GATA5, PGSI, HT014, C20orf6, NAV2, KIAA1357. GABRB3, FU10378, HSPC150, 
ADCY3, BIGM103, MGC3067, APC10, BOC, LOC120379, KPNA4, FKBP7, C14orf50, FU22557, NUDT10, DDX17, 
FU22729, TA-NFKBH, FU10785, FU32745, WHIP, CTLA4. MRPL30, MRPS25, FU10498, CDOl, FTCD, SPTB, 
KIAAI323, DKFZp761F01 18, MGC2452, AKAPI3, LMLN, LOCI 12840, FUT10, TP73, PDCD7, KIAA1274, Tenr, CRR9, 
KIS, SPG7, HSFY, LOC92691, POLH, SMC6, MSCP, FU10378, DKFZp434FI8l9, CSTF3, CPNE4, HINT3, HSPCA, 
KIAA0982, P53AIP1, ING5, DKFZp434D0513, STI2, SEC14L2, BCL1 I A, EPI64, FU25530, GPR49, IRA1, ARHGEF7, 
USHIC,RBM6,DSCR8,FU35863,NXPH1,MGC46719,MGC10981, ZNF398, CYBB, MGC4170, KRTAP9-4. NCOA6IP, 
HCAP-G, DMRT2,COR01A,Cl2orf22, MLL, K1AA1753, DMRT3, KJAA1557, RAD18, FTCD, EIF2C2, KJF13A, 
DLL3.KRT19, TRA@, SCAND2, FU25286, ZDHHC4, SEC13L, GPR92, ZNF207, FU14600, USP2, HDAC9. 
PRKWNK3, DISPB, CENPH, MGC29667, LOC149420, PRPF18,CHD2, KIAA0599. MGC16824, 1RTA1, ZFP28, 
LOCU2840,KIAA14U,LOC5U94,SLC4A5, LOCI 15098, KIAAl 720, MGC40397, FU36874, NESH, TMF1, LGR6. PF1, 
MGC 16943, TUFM, HERC2, DKFZP434N151 1, FUI 2697, NLN, FU32827, CSRP2BP, RUFY2, RBM11, UBE21, YAP. 
LRPI5, CFLAR, OSBPL5, NPD007, ZIC4, OR51E2, MGC1730I, PAX6, FUI2697, MGC35366, U2AF1, TU12B1-TY, 
BAG2, SLA/LP, B1CD2, K1AA1465, DKFZp434G0522, ZNF354B, FUI 0420, DARS, KIAAl 337, DKFZP434C0826. 
KIAA1712, CDGAP, FUI 0324, ARHGEF7, DKFZp434G0625, HES6, MY050, CSNK2A1, MPHOSPH9, HDAC10, 
KCNJ16, LOC135763, EKN1, ORAOV1. FU31528, POU4F1. MGC42174, SYNGAP1, RRP40, MGC10744, FU12363, 
TTC7L1, DKFZP761N09121, ZDHHC1 1, MGC8721, IRTA2, ODAG, TRPM7, KIAAl 878, TM4-B, DKFZp76lH039, 
ADAMTS9, CG1-203, KIAAl 881, FU20003, SPPL2B, FU13386, RPC5, CTLA4, FU37034, DKFZP586N2124, 
DKFZP434D0I27, KIAAl 966, KIAAl 946, MGC20255, SPINO, FU90013, ALS2CR7, SH3GLB2, FU33962, FU23027, 
PROK1.GABPBI, MIPOLl,MCM6,BAP29,VlTl, SYNGAP1, PEL11, FU25477, WBPI, ROCKI, ABTB1, LGI4, 
WNT5B, CLDN6, FBX02, Cl8orf2, GAJ, TRIM7, FUI 3993, PEX5R, CECR6, PR, LOCI 51648, POSH, HRIHFB2072, 
SOX7, LOC139231. DKFZP434K0410, SOX6, CHPTl, NUP133, PSG5, FU22688, YME1L1, DKFZp313A2432, 
MUSI, FBXOS. KIAAI444, BCR, EPB41L5, RNPC2, HTAT1P2, KIAA0436, NSl-BP, LENG3.GLS. MIXL1, WDR9, 
DKFZP586M0122, KNSL5, G3BP, KCNJ2. PTBPI. DKFZp434N14l5, SEMA6D, LOC63929, PTER, NAVI, FU39441, 
MIDORI. MGC1 4793, BAT4, FUI 2987, SEPP1. NYD-SP17, ZnTL2. FU35725, C6orn2, GSBS, MGC40157, KJAA1458, 
AUTS2, FBXU2, KIAA14S3, C20orf44, MGC20533, PGS1, FU11053, MRPS1Q,EML4 ( MGC 14793, POLR3K, RINZF, 
MOBP, FU12298, PIST, D6LOSF, MOC2629, NPHP1, DKFZP434DI42I. AKNTL2, NDUFBt, DKFZP667C165, 
FKSG42, HAL, WBSCR22, MRPS25, DHCR24, LY6G6D, LCHN, DKFZp76l A052, DKFZP434G156, TBX3, FU2I839, 
BRUN0U. NYD-TSPG, WAAI706.5TYX, MMD, LOCI 13321, TWM35, ZNFNIA4, DKFZP586B03I9, KJAAI798, 
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FU30829.FU 14281, DKFZP586G1517, MGC2629. DDHD1, CRSP6, FU1I252, TRB@, GNAS, FU12975, KIAAI458, 
COL12A1, SPINO.K1AA0478, FU2008S, SOX7, DRF1, TBDN100, BHMT2, ZFP91, SRMS, MGC15523, KIAAI919, 
FU23816, FU1I125, C20orfl51, STK31, RTBDN, FKSG83, GLI4, FU22548, KIAA1912, C20orf42, TRIPIN, NDUFS7, 
m HSPCUfl, MOOaMOO, YR-29, SCDOF.B.KCNJ15, CLLD8, ZDHHCfi. MOC10734, MOCM2 1 «• DKFZp547B052, 
$ DBraiI8,MaC24030 t KIAA104A, FUlOtJft, ACM ID, IIM, TOM7, MOC3165, TUMI, WHIP, LZKl, LOCOOMO, 

IRTA2,KIAA1360,NXF2 ( KIAA1317, DXYS155E, FU319H, HSPC154, HI9, BAP29, PRXRA, PLAC3, LQC58486, 
FABP4, LOC130617. JAM3, LOC57019, TF, USP24, FU20222, FU20354, K1AA1836, MGC3040, SAC2, BARHU, 
DSCAML1, STK35, KJAAI337, KJ A A 1 276, LOC 1 1 5557, FU 1 4600, ROCK1, FU38359, MGC33215, ATP9B, UBE3B, 
C7orf3, PRKWNK4, DKFZp434J0617, MAPK1 , PRKCE, K1AA2028, OBTS1. KIAA0716, OMRTC2, FU10998. 

0 FU32069, LOCI 15330, FANCA, DGCR14, KIAA 1 337. FU23577, FU22761, FU35155. FU22329. FU14427, 

FU20557,FU20321,ROCKI, PPP2R2C, BCoR, FU00058, LAMA1 , FU20898, FU3I606, PCDHB4, DKPZp547M 1 09, 
CLASP2, KCNQ5, LOC51240, FKSG79, OAZIN, FIJI 3576, MGC4473, LACRT, NAG73, HSA251708, HSJ001348, 
TRA@, DKFZP434A236, MNAB. HAP1, MGC24995, DKFZP566C134, KIAA1 501. MGC 13090, C8orn3, GGTL3, 
FU35757, CRYPTIC, CI4orf35, K1AA2015, FU12303, LOC92033, FU20171, FU31340, TMPRSS2. R1P60. ZNF272, 

5 FLJ20641,RP4-622L5,CENTA2, C20orf64, HHLA2, DPMI, PRKCL2, GNG2, and RTN4IP1 . 



Table 7A 

Genes Up Regulated in Un-Passaged Tumorigenic vs. HSC 

JCRT19, C3, GOLPH2, CR1P1 , PTGIS, BF, RA13, CA12, S100A8, PPL, TUBB, CXADR, NNMT, ITGB5, COL3A1, FN1 , CIS, 

0 CD14, EFEMP1, COL1 A2, GJAI, FU20151, LGALS3, TACSTD2, LGALS1, FN1, MUC16, COL1 A2, KRT7, RARRES1, 
DSP, ID4, HRASLS3, S100A1 1, CYR61, SLPI, C4A, LGMN, S100A9, SERPINB2, MAFB, COBL, WTI, TGFB1, SPUVE, 
CD24. DKFZp564A176, ANXA2P2, SI00A10, ROR1, EGFL6, FNI, MUCI, ALDH1 A3, PARVA, CDH3, FN1, TIMP1, MGP, 
AGR2, KRT1 8, DC 1 2, CHI3L1 , CD24, FU20273, ID3. HI 1, HLA-DQA1, ANXA2, SERPIN A3, RAB3 1 , ANXA2, RAB3 1 , 
EMSI , FERIL3, KIAA1 199, CX3CRt, FU1 1619, KLK1 1 , CD24, TIMP2, CCNDI , LOC51760, FLRT2, HP, GPRC5B, 

5 IL13RA2, APOE, GAS1, PPIC, MAPK13, KJAA0882, APM2, PLAT. MYL9, MY06, COL3A1, ANXA2. RAB31, 1GHG3, 
PMP22, FAT, S100A8, MARCO, PTPRK, PTPRF, CD163. DF, C4B, COL1 Al , IGKC, TFF1 , TGM2, CTSL, ITGB5. 
G A LN AC4S-6ST, IF. RARRES2, ADAM9, VCAM1,CD9, ID4, APOCI, PDEF, VIL2, GRIA2, RIG, MET, GNGI2, CD163, 
FU22662, CAV1 , PRG4, CDH1 1 , IF127, TM4SF1 , NNMT, DUSP4, THBS2, COL6A1 . FGFR2, TNXB, A2M, UPK1B, BCHE, 
IFI30, MAF, KIAA0752, TPD52L1, KRT8. FXYD3, CKAP4, ALDH1A2, ANXA8, BCMP1 , ALDH8A1 , ASS, EFEMP1 , LTF, 

0 FU2015t,T1A-2, SELENBP1 , CTSH, GPR64, TJP1, RARRESl, SYN47, PDGFRA, PRSSl 1, AQP1, COL5A2, EPHA2, 

ITSNt. SULF1, PTPN3, LGALS2, OGN, CTSB. IER3. FMOl, SNCAIP, PPAP2A, MGC2376, GATA6, IL1R1, CD1C. MEIS2, 
TACC2, C1R, AQP3, LR8, SLC7A8, S100A6, ATIPI, MIG2, TNXB, MAOB, DCAMKL1, DPP7, ANXA3, RBP4, ziziminl, 
CH13L1, FARP1, CLMN, BNC, HCA1 12, CSPG2, CD24. EMSI, CEBPD. ILI 3RA1. RJL, COL4A5, KDELR3, CAP2, MAF, 
TFPI2, DOC1, CSPG2, LGI2, Z39IG. CYP1B1, CAVI, ALP, ERBB2, LAMA4, CSPG2, LOCI 13146, LAMP3, ARGBP2, 

5 MNDA, DKFZp564ll922, CAV2, MARCKS, TPM2, LOC92689, GFPT1, N33, SECTM1, WFDC2, CLU. ROR1, TST, EFS2, 
GUK1 , C1QB, CPE, CRYAB, TSTA3, CALB2, EGFR-RS, PPAP2A, PTPRG, SAT, TFAP2C, C2, RCP, SULF1 , SFN, - 
LAMB1, IL13RA1, PHT2, BMPR1A, LIM, FLNC, N33, ST5, CSRP2, FU23091, PAPSS2, IGSF4, TNFRSF6, STEAP, 
BACE2, SERPINB7, CALU, PDXK, PPIC, TACC2, CLDN4, GPNMB, RJN2, KIAA0599, LUM. KIAA0790. CARDI0, MVP, 
PDGFRL, RRAS2, KJAA1078, AKAPI2, ARHE, RNASE6, BLAME, TM4SF1, TIA-2, K1AA0869, MPZL1, N1D2, DDR), 

□ DUSP4, LAMAS, SGCE, UBD, LGALS3BP, ENPP2, SGSH, COPE, KRT5. SEMA3C, IGKC, COX5B, ELOVL1. SI00A14, 
APEG1, ALOX5. TM4SF6, LMNA, DSTN, RAB20, DNAJB2, TYROBP, LJPKIB. KDR. P4HB, FU1 1856, ClorD4, ADM, 
NR2F2. PLXNB2. ITPR3, S100B, SOX9, DCN, EPS8, EFA6R, ZFPM2, PPF1BP2, SERPINF I , NQOl , NMA, AADAC, 
COL6A2, SERPnMEl , MT1 X, MGC3047, NCKAP1 , DDR1 , TLEl , EPN3, TBX3, CDS1 , HSPB1 , DPP4, CTSB, NEOl , 
TMEM8, NFIB, FKBP2, TNFRSF1 IB, FGR, FMOD, P4HA2, TNFRSF12A, ERBB3, NQOl. LAMC1, PR01489, IGFBP3, 

5 MYOIC, KIAA1026, SLC6A8, PDE4A, HML2, FU21562, C8FW, MS4A6A, KCNK.1, C3AR1, AK1, MT2A, KLK10, 

KIAA0429, IGSF3, ARNT2, DCN, C12orf5, CD24, C4.4A, SFN, CRABP2, VIL2, CLECSF6, HCK, SIX2, TSSC3, CCR7, 
GFPT2, TUBB-5, ENAH, SLCI6A4, CI 1orf9, FU20761. SARI, GPC1, MYOID, RGS16, DCN, MT1L, PCDHA12, SGSH, 
RHBDL2, GLUL, CKMTI, NPAS2, EMP2, DAB2, DSCR1L1, MATN2, BLVRB, PLAB, MT1G. WIT- 1, OASIS, PPP1R3C, 
NQOl , AMOTL2, TNNT1. AZGP1. PARGl; SLC7A7, COL5A2, NEDD4L, DCN, SERPINA1, DFNA5. SAMHD1 , IQGAPI , 

0 THBD, DPYS. ADAMTS5. MGC10848. NEBL, RAI2, TUFT1, KCNJ15, LIF, CD151. DAF, ILIR2, NRXN3, HK3, FCN1, 
CXCL1, CALD1, PCDH7, Clorfl3, TRD@, NFIB, VEGFC. CCL22, CD63, CTSZ, KYNU, ADFP, HRH1 , CTGF. GRJK2, 
ANG. KJAA0790, SNK, CST3, SDR1 , KIAA0703, MGC35048, ANXA9, YAP1 , ADH1B, CLDN 1 , TIP-1 , COL1 8A 1 , DOK5, 
GPRC5C, IGSF4, ABCA8, KDELR3, PPAP2C, KIAA0440, 1GF2R, VLDLR, OSBPL10, SLC12A8, NPD009, RPL37A, 
MAPT, FARP1, LAMP I, DAB2, KRTI7, SSH-3, ABCA3, PHLDA1, FBXL2, LOC114990, LOX, ALDH3BI, RIG, SDC4, 

5 CGI-38, ZFP36LI, FOLR2, DLG5, PFC, BGN. DSC3, WARS, FU21610, MGC2494, PCOLCE. FCERIG, FGF13, MD-2 
UGCG, BAG3, MAOA, CAPN2, CCR1, TRIM2. CLU, NR2F6. KIAA1598, GPR65. TRD@, PPARD, HSPA6, K1AA0436, 
DPI, GRN, ABCA1, CD59, ITGA3, NT5E, SLIT3, CDC42BPB, ZNF144, LTBP2, FERIL3, PCOLCE2, FST, CSTA, 
CLECSF6, HOMER-3, LDB2, SLC34A2, TEAD3, PMM1, EFEMP2, HN1, FU20539, TPM1, CXCL6, MPZL1 , 
DKFZP434B044, GS3955. CHST6, RPL5, IL1RL1, RIS1, SN, CDKNIA, PIGPC1 . SLC4A2, SMARCA1, GBP2, RNASE4. 

0 EFNA1. MCP, DPP4, HSPAI A, LRP10. GRN, SLC39A1, PFN2, BC-2, WNT2, FU23186, TPMI, SIAT4A, RNASE1, PLS3, 
TIMM 1 7A, DDR1 , FU20366, EFNB2. PSPHL, MEOX2, K1AA0429, SDC2, MGC1 0796, SERPINB5, CAST, MY06, CRIM 1 , 
TFPI2, NCF2, FU22531, LISCH7, SLC7A1 1, MGC1 1242, PKNOX2, RARRESl , FBP1. CLIC4, CAST. C5RI, SPR, BCL6. 
RIPX, GRN, K1AA0934, HSPB2, SPARCL1, CTSB. S100A1 IP. 1GF1. BCAR3, ASTN, RRAS2, FU21562, KIAA0992, FHL2, 
H LA- DOB, LAMB1, MAP4K4, EFEMP2, KIAA1029, PP1057. SLC7A8, TLR7, MMP15. WDR1, GHR, TJP1, PCDHGC3 

5 MMP19, ARHD, RIL, NOL3. WNT5A, RABI7, F-LAN-l, IGF I, BMPR1 A, TLR2, FTS, EPB41L1, TPMI, CDID, YKT6 ' 
GRIM 19, WARS, AXL, MIF, CLIC3, MAPKI3, SSB1. SEC61 Al. PDGFRB, IL10RA, CLTB. PCNP, SNAI2, SGCB, 
CYP39AI, FU90798, SBBI31, FZD2, AMMECR1. SOCS5, KJF1C, S100A13. CLDN7, PBX1, TJP3, RGL, FKBP1 1, GRP58. 
EIF5. 1GFBP1, FUI3612, G0S2, TNFAIPl, TIP-l, PSEN2, PPIB, DAGl, ARF4, AHNAK, LOCI 15207, PCDHGA1, MSTI R, 
SH3GLBI, SC65, MGST3. BMP2, CTSB, TMSB10. TRIM38, ITSN1, MPZL1, ARHC, KIAA1078, PLTP. CRJM1 , CI lorf24 

□ KIAA0746, MGC2376, COLECI2, BBOXl, WNT2B, HUMPPA, PAM, MAP4, FU219I8. SLC2A6, MYOIB, NFE2L1 
DXS9928E, SLC1 Al, TUBGCP2, SULT1A1, QSCN6, LOC51 159, PSK-1, CYB5R2, RAI14. LI CAM, KCNMAI, CD1 E 
HOXC6, THY I, PTOV1, EDG2, SUCLG2, AQP1, DDR1, TMEM4, EDG2, FU22833, KCNKI5, KIAA04I7, TCF2I, 
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ASML3B, HSPC163, LAMA4, APOC1, DKFZp76IF2014, SLC21 Al 1, CXCL14, FCGR2A, FU20967. MRPS12, FU131 10. 
KIAA0913,SHC1.DP1,TLE1,SLC2A10, PON2, SPAG4, ITSN1, ACTL7A, RBP1, ILIRAP,C22orf2, ATP1A1.DES, MST1, 
PHLDA1, KIAA0934, S100A2, ID4, ITGB4, CASK, SLC31 A2, C2torf97, CD86, FBX09. AP1M2, D2S448, ADCY9, 
PALMD, PTPN2 1 , TRA@, PPIB, EPB41 L4B, PNMA2, RSN, SYNGR2, SLI, FYCOI , CLTB, MGC16723, CKAP4, PLEC1 , 
FU10521, B4GALT4, ID1 , CDA08, OPTN, PTHLH, MYOl B, UM, TLR5, FU23516, CAST, CTSL2, CSF2RA, C14orf58, 
SLC7A8, TREM2, CST6, ARHN, ST14, PTPN13. SLC5A7, DUSP5, B4GALT4, DKFZp667G21 10, TWIST, SC65, PPP2R1B, 
ITGB5, KJAA1096, EVI5, RAB2, CTSD, SL1T3, KIAA0284, NPY1 R, HERPUD1 , PMM2, HSD3B1, HPIP, UNCI 19, 
KDELR2, FU10199, PLOD, GTF2IRDI, SQSTM1, BD1CRB2, WSB2, DPP3, LOXL1, SEMA5A, TMP21, CLTB, DNALI1, 
CXCL13, FZD1, CNN3, KDELR3, ADAMTS2, MD-1, TAT, FU20234, DKK1, FLJ10856, TM4SF6, K1AA01S2, FBX02, 
CLECSF12, PRSS16, KIAA0I03, UGDH, YIF1P, P8, SNTB2, GOSR2, KDELR2, D4S234E, HABP4, ANK.RD3, CCL1 8, 
TEGT, EGFR, ATIPt, EPHB3, H GSI65L151 , TCEB2, AGRN, NBL1 , FLRT3, NPAS2, SC02, MAOA, NFE2L1 , APLP2, 
MED8, LRP2, SMARCA1, TJP2,"p47, FU10055, EPS8R1, TGIF, AGRN, SEMACAP3, DSC2, FBLN2, ORMDL2, 
ADAMTS3, PTGDS, CENTG2, MMP14, SNARK, PTGER3, DPH2L1, PTPN2I, DSCRI, PP1665, PTK9, AFFX- 
HSAC07/X00351_M_at, HAMP. TOB1 , FACL3, GMPPB, CSRP2, P4HB, NPC1L1 , PIG7, VNN3, ARIC5. PODXL, ACADVL, 
GNPI, FU10261, UPLC1, SFN, PEA 1 5, MLCB, SLC3 1 A 1 , ICAM 1 , UP, SLC4A4,C1 lorf!7, PTGER3, ZFP103, CYP-M, 
HMOX1 , SLC21A9, TCN1 , SLC20A2, RBSK, WNT4, CYBB, ANXA4, DNAJC3, M1RO-2, ARHGEF4, SULT1 A3, GOLGA2, 
PTPRF, NDUFB7, TBC1D2, MSR1, COROIB, FADD, ATP6V1D, ALDOA, EPLIN, MST1, TD02, ETV2, CCR5, SERF2, 
GTPBPI , COL4A2, ASPH, ELM03, DKFZP564A241 6, BAIAP3, APLP2, PDE8A, 1FNGRI, GREB1 , ANXA2P3, CAPG, PTS, 
N33, MGC1 1256, PLA2G4C, HFE, FU90798, FLNA, LMNA, IRX5, SRPX, LOC160313, SLC33A1, CSTB. FU20152, 
ATP6V0E, HSPA1 A, KRT6A, SARI, POR, NDUFS8. CCL2, B4GALT1 , TMSB4X, FU2070 1 , ACTN 1 , 1 L4R, F5, CD5L, 
IGFBP3, ALOX5, AUH, CKAP1, CCR1. KIAA0843, UGTREL1, GAS2L1, AP1M2. RARRES3, PPGB, LY6E, GNB2, 
CTNND1 , FPR1, ALDOA. PC326, KJAA0980, PGM 3, DHCR24, PTGDS, LAMB3, ALDH7A1 , KIAA0716. TCI0, 
K1AA1096. IL1RN. CI lorf24, FDXR, SERPINB3, COL6A1 , FU20296, DTNA. IGF2R, TRIM36, FU22593, 1FITM2, ARHD, 
KIAA0220, OCRL, SDC2, KIF3B, GALNTI0, PRKAR1 A, VTHB, PSAP, PTPRO, FGF2, PCSK7, SUCLG2, ERP70, 
FLJ20254, M LP, COR02A, ILI3RA1, RGS16, MEIS3, FOLR1, LGALS8, LAD1 , TGFBR3, NDUFA3, LANO, AFAP, SGPL1, 
UBXD2, GM2A, PCDHGA10, PACSIN3, CFL1, PAM, GOLGA2, GSTM3, CREB3, C14orf92, IGL@, FU21313, SYNE-2, 
EPHX1, MRPL17, PCDHGC3, MAP3K6, DNCH1 , TM7SF1, LARGE, VRP, IL6. K.IAA1096, SARS, PSMD8, COX17, GPX4, 
SULF1, NEU1, ISGF3G, PLP2.CYR61, ATP6VI D, EIF5, FLJ20847, DKFZp761K1423, FU1 1526, EHD1, KMO, KIAA1735, 
RGS3, SDFR1 , ASM3A, FGFR2, FCGR3B, TPM4, CPE, FLOT1, CNGA1, SPHK2, FBXL7, SH3GLB1, LAMP2. EHD1 , 
PLXNB1 , VCP, SNCB, ITGAV, FU21047, STAT3, PSMC4, CALD1, DES. ALDH3A2, VDR, PAPSS2, MGC13523, ARF1, 
NDUFA2, PPAP2B, FUS 1 , ASNA 1 , TUBB4, MGC4504, RGS19IP1, ATP5H, TSTA3, Cab45, RDHI 1, ECGF1, TMEM2, 
GALE, WSB2, NSAP1 , WFS1, HSPC003, GOLGA1 , SH2D1 A, FU20986, KRT17, UNC84A, MYL6, LAMC2, FGF18, 
HS2ST1, RNPEP. TC10, FU14675, MGC3178, TM9SF1 , GALNS, SORT1, HSPC019, SULT1 A3, ENC1, RAB9A, CED-6, 
C21orf97, HFE, FUCA1, KJAA0674, EHDI, PLAUR, CETN2, TPBG. CYP27A1 , MAN1CI , PPP1R13B, ATP5J2, THBS3, 
FKBP10. YKT6, PIGO, CYP4F12, LRPAP1 , ITCH, MLF1 , ACTN4, EIF2AK3, PDE4DIP, DZIP1 , TUBB4, SEC24D, 
Kl AA0143, ITPKl , FU 131 10, AP2B1 , IFITM2. SCN8A, STS, CDC42EP4, ARPC1 A, CD2BP2, CACNG4, SULT1 A2, TAFI0, 
BRD2, TRAM, HSF2BP, UBC, ADAMTS9, AQP9, RALA, COL15AI, DYSF, LAMB2, RPL5, EHD1, CLCN3, ARF4L, 
HDLBP, NPR2, HRB, SQRDL, MIG2, NAV2, TBC1D1, TPD52L1, VTN, ARL1, CYB5, LGALS8, COPZ2, FU21916, 
FU20421 , P4HAI , TBLIX, ANGPTL2, KIAA0992, NRPI , SLC21 Al 1 , ICMT, STS, EIF5, PIP5K1C, RDS, PVRL3, PON2, 
H1G1. DLAT, LOC64I82, RNF3, ACAA1, UQCR, FLOT1, TC10, DSTN, TEAD4, RERI, TREM1, IL17R, PLCE1, SLC6A8, 
HIMAP4, PILR(ALPHA), TRIM38, TXNDC4, CTSK. DSS1, LPHH1 , SGCD, PEN-2, K1AA0527, RRAS, CD3D, LANCL2, 
P2RY6, TUBB, RACI, AAK1, LOC51762, ALOX5AP, GNB1, FKBPI1, RNASEH1, EPB41LI, GPRK5, GPI, HMCS, 
PTGER3, SSR4, FKBP9. AK3, CBLC, SGPL1 , PLCDI , MED8, ALDH3A2, IGSF6, KCNN2, HS3ST3A1 , MLCB, TRJM38, 
FCGR3A, IFI35, ABC A 1 , DKFZp564A 1 76, FSTL3, MAPKAP1, ENTPD3, FU23514, HS3ST1, IGHM, PM5, NDUFB2, 
TOMM22, ANGPTL2, KRT7, SSH-3, ELOVL1 , NPEPLI , NEDD4L, PARVA, PTK2, SEMA3E, NCBP2, KMO, QP-C, ECM2, 
ATP9A, HMOX2, SMAP, SLC9A3R1, ATP1B1. PCDH7, EDFi, OPCML, NEDD5, FU10466, CBX6, CDH6, MAN2BI , 
CYB5, SLC38A6, FU12443, ASPH, MOB, HUMNPIIY20, DC50, PSMD5, LRRF1PI, FU22160, PAFAHIB1, 
DKFZP586L1 51, BLAME, TAZ, ATP6V0B, APBA2BP, RISC, ADRA1A, PIG3, TNFRSF21 , CBFA2T1 , EML1 , EPIM, 
APOE, W1SP1 , CA12, V1L2, RAI, FAAH, ATP6V0D1 , CD97, JAG1, STX4A, Cab45. NFE2L2, PPP1 R12B, ZMPSTE24, 
KIAA0500, IL1 7BR, RRAD, PGM1, CD59, ADAM 19, NPEPPS, FJX1, GAA, SOX13, FU22638, BAIAP2, DUOX1, TGFA, 
FU20719, LMCD1, BBS4, MARCKS, GM2A, FU1 1200. MAPK3, WWP1, FU20152, SMARCA4, PSCA, MCJ, ARF4, 
SLC35A2, SKD3, CDC42EP4, SLC22A1 L, SSH-3. SMARCD3, PDLIMl, IL27w, CGM35, COX5B, LOXL2, CRK, GOLGB1, 
PSMD4, M AGED 1 , CDC42EP 1 , HSPC171. SECUL1, KIAA0265. PSEN2, XLKD1, STAB 1, FU2 1079, FBLN1, INSMI, 
FLJI0252, MPDU1, MGC3067, FU11 181, TPARL, TUL1P1, DUSP8, UBXD2. CPD, HSPA4, FU1 1807, GPR1, CTNND1, 
TNFA1P2, MAGEDI, MMP9, CKAPl. UGCGL1, SMP1, FU22678, BZRP, COX8, BDKRB1, HOXC4, , H19, NMES1, 
SMOC2, PIGPC1. TEM8, PTGFRN, FU23091, IGKC, ALS2CR9, 1MUP, MIG-6, MAL2, SPUVE, YAP1, CXCLI6, MY05B, 
KIAAI 244, PARVA, SYNE-1 , FGG, AGR2, KIAA1500, RERG. NTN4, TMPRSS3, ARHU, RHPN2, GLIS2, UGCG, SULF2., 
BOK, OGN, CLDN1, DKFZp434G171. FAD104, KJAA1165, ShrmL, PTGFRN, AD037, OSAP, LOC51760, MS4A6A, 
FU20273, MS4A6A, FU23153, NAP1 L, LRG, LOC55971 , MGC14839, FU30532, UNC5H2. FU14299, TCEA3, CTL2, 
ORF1-FL49, LOC155465, ENAH, OSR-1. SBBI31, DAGI, EDG3, PSK-1, MGC2615, ALS2CR9, DKFZP761L0424, TBX3, 
FZD4, FU20171. DKFZp761P0423, NGEF, TOBI, CtQG, DNALI1, MGC35048, GUK1, DKFZp586Cl021, KIAAI500, 
LOC83468, p25, CCL26, GNGI2, SAMHD1, ID4, B4GALT1, DKFZp434D0215, GJB2, FU14957, PRO2605, MGC13040, 
CHDH, ALDOA, FST, TEAD2, KI AA2028, FLRT3, FU3 1 842, CDKN2B, MGC 1 6028, 1RX3, TEAD1 , MGC33662, MS4A6A. 
SEMA6D, DKFZp434E232l, PKIB, PKJB, KIAA1671, FU22174, LOC128153, COTLl, SAMHDl, MGC24103, UACA, 
SELM, CGI-85, NAPIL, CAMK2D, C4orf7, BOC, MGC1 1034, DKFZP564J0863, DKFZP434H0820, PARVA, SPP2, 
FU40432, STEAP2, PDGFA. BACE2, FU14834, LOC55971, ANGPTL1, MFI2, KIAA1337, WNT7B, IPP, DKFZp547D065, 
MGC39325, CTL2, SAMHDl, LNX, MGC26963. KIAA1324, MGC16212, K1AA192I, ALS2CR9, CXCL14, SPPL2A, 
FUI4525, ENPP5, MGC29643.TCF21, ECGFI, PCDHB14, CFL2, GRP58, TGFBR3, DKFZp434F2322, FU22474, RCP, 
KJAA1866, MGC10974, PHLDA1, MGC 12335, SYTL2. LOC51242. PCDHA10, KJAA1 145, KLFI5, TMEPAI, GRIA2, 
LOC92689, SIPU H19, FAD104, CI lorn5, MGC39329. MAFB, BCAR1, RDHL, C14orf50, DRAPC1, RORC, MYEOV, 
GPR92, DUSP16, GFRA3, ZD52F10, FUI4735. LOC113026, FU20048, CLDN1 1 , CDH24, TLR8, FU31052. C(27>3BETA- 
HSD, YAP1, EMS1, GATA5, FU23420, FU10035, IL28RA, MAF, HMT-1, DERMOl, D1RC2, HSPC163, ARHU, 
LOCI 14990, MSTP043, CGN, DUSP16, ODZ2. INMT. GPR. CRBPIV, FU22558, KIAAI 145, TCEB2, LOC55829, SEMA4B, 
COLI2AI, MGC1 1034, KIAAI576, MTA3, ATPIB1, C20oifl55. SDCCAG28, MGC16028. CXADR, CTSB, KIAA0146, 
MGC33602, CLDN12, RAB23, DKFZp434P2»2, PR027U, BTBD6, MRPS10. SNX9, 1L4II, DKFZP43411735, LOC91523, 
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AFFX-HSAC07/X00351_M_at, RERG, FU14642. FU22833, MY05B, SDCCAG28, RABIO, LBP-32, CUorOI , DLG5, 
FU22415, PCDHB 1 6, MGC10204, C21orf63, DKFZP434K0427. NRP2, KIAAI870. TEAD2, SPTB, FU33SI6, SURF4, 
NPD007, PCDH20, MGC19825, MGC26818. MGC4604, KIAA1337, ESDN. FU23091. MacGAP. CGI-85, C8orfl3, 
FU40021, MS4A7, LTB4DH. PLEKHAI, SORCS2, CRIM1, FUI 1200, HS6ST2, FUI0697, WW45, LOC132671, DCAL1, 
5 SNX9, DKPZp76l»aMa, IOSP9. LOCS716S, LOC9070I.OPCRI.AK2, PU31S64, KIAAOSW, ANOPTLI, PBX033, 

KCNKti, MRPL4I, PZD8, UOCQL), C0PZ1, RBMSI,C20orf23, Cab43, TRIM7, OAZIN, FU10210, SYTL2, FU2Q442, 

C20orH39, K1AA1394, C20orfl 10, MOC13I4, C20orf!S2. CNN3, MacGAP, CAC-1, MAPIB, FU4002i, PR1C285, RAP20, 

TMPIT, KJP1». OPKAI, DKPZ«76aAai7, XPHI, RM1LIN.1, pUUOiO. IMUOI , AVI , HDUP01 0, BMP. NTi», CORTOPa. 

PU32194, FU90440, LOC147T00. MOC21B74, KRTI9. PCDHA10, DTNA, ROC32, ULBP2, H2AFJ, CFL1, MOC2601, 

1 0 DKFZP566F084, SLC26A9, KJAA1404, PX19, APOAIBP. WASL, TLR7, FU20739, FU25157, FU22833, MGCI4353, 
DKFZP566J2046, SNX8, BHLHB5. TAFI0, FUI 4594, MRAS, FUI 451 1, UBXDI. AMID, ANKRD9, ACTR3, TMEM9, 
DKFZp761N0624, FU20748, ROR2, LOC91461, TLE1, SEC14L2, BAT5, SSB1, E2IG5, K1AA1357, MBC3205, FUI 1046, 
FUI 468 1, HSPC242, DKFZp547A023, CED-6, K1AA17I5, TNKS1BPI, ATPI IA, EHD4, INADL, FUI 101 1, KJF3B, 
DKFZP434K0427, FU32069, CSEN, DKFZp761D06l4, MRPL4I, PXMP4, LOC845I8, LOCI 15265, LOC51255, ATP6V0B, 

15 N4WBP5, GGTL3, MAGI-3, MLLT4, LUC7L, ER01L, MGC13114, MGC39807, CAPNS1 , TRJM47, GPR34, KIAAI200, 
N33, PSCD3, NSEl, BAL, C20oH24, MGC22805, KJAA1337, CDH1 1, LOC51248, KIAA1 126, FU901 19, PVRL2. ARHC, 
SSBP4, DNAJC1 , E2IG5, FU10702, NUMBL. SET7, BRI3, FU32069, FU20O97. KIAA1 870, Ct4orf31, TP53INP1 . NCAGI , 
GSH-2, FU21963, K1AA0599, MPP5, SCDGF-B, AXIN2, CGI-149, CGI-97, MGC19825. DNAJA4, SMOC2, MRPL27, 
KJAAI542, ARHGEF5, CAMK2D, SLC21 Al 1, FU37318, C20orf64, D1S155E, UNC84B, MGC26963. <U55C23.6, GK00I, 

20 CPNE4, MGC1649I, FHOD2, HTPAP, K1AA2002, PRDM6, FGFR1, DKFZP564B1162, HLA-C, PRDX5, FU20623, 

FU20719, CI4orf47, MYBBPI A, RDH13, DPP3, PCDHB 18, NOL6. JAMl, LOC545I6, FU10210, NRXN3, MRPL53, 
K1AAI643. MGC15523, LOCI 15704, BR13, GTAR, KIAA1434, MGC335I0, FRABIN, UBQLN1, MGC3195, FBX032 
SMPl , FU10902, Clorfl 3, CGI-72, MGC45474, TRIM 8, HM13, NFKBIE, FU22004, AD-003, MMP24, RBM8A. DNAJC5, 
C20orfl69, NOR1, METL, MGC2747, FU 1 4251, DKFZp451G 182, KIAA1363, FU23393, RNF19, STK35, AMID, MGC4604, 

25 FLU, DKFZP566J2046. SNAP29, DKFZp547A023, DKFZp434F2322, SLCI 7A5, FU141 17, MGC4342. SLC31 Al , MGC2555 
KLF2. NKD2, SEC61A1, LOC91012. MSTP028, FU2042I, MGC40555, KIAA1554, AD-003, SURF4, GALKl. FACL6, 
DKFZP434D146. GPT2, BRPF3, KIAA1 165, SLC30A1, FU20542, KJAAI255, JUB. SYNP02, SURF4, MGC2550, 
LOC90507, SYNP02, ARFGAPI, KIAA0599, DNAJB1 1, UBE2H, C20orfl49, PHP14, FU23577, FU23654, LOC51290 
DJ667H12.2, FU23277, LOCU5098, DKFZp5470146, LACTB, FU90575, NEK6,Cab45. MGC13045, SRA1, DPP9, SFRP2, 

30 LOCI 13179, KIAA1784, C20oifl49, CGU)9,GBP2, PDK4, HRMT1 LI, MGC33993, MESDC2, IDS. RDGBB, RPL17 
TEAD2, SEI1, C20orf38, HSPC210, KJAA1 163, KJAA1223, RAB18, NFKBIA, SEPP1, B7-H3, MGC33607, CAB56184, 
SDCBP2, PCDHI8, SPEC I, RAB18, SH120, MGCI 1 102, MGC19825, LMLN, REN. CALM 2, PPP1RI4A, NDUFB9, . 
KIAA1026. MGC20486, FU30803, AKIP, LTB4DH, DKFZp547A023, C20orfl67, FU31937, FU20186, APXL2.CFL2 CGI- 
20, KIAA1437, PVRL2, KJAA1295, KIAA19I2, DC-TM4F2, CDW92, RPS27L, CAMK2D, RAB18, FU21415, MGCI0999, 

35 KIAA1896,KIAA1337,CGI-69,andSTCl. 



Table 7B 

40 Genes Down Regulated in Un-passaged Tumorigenic vs. HSC 

HSPC053, HOXA9, SPINK2, HOXA9, MPL, KIAA0125, BEX1, FU14054. CD69, ANGPT1, AKR1C3, LAGY TNFSF4 
HLA-DQB1, ITM2A, KIT, GUCY1B3, PLAG1, PROML1, MYCN, MLC1, LYLI, MPO, HOXA10, PCDH9, , PLCL2 HLF 
SV2, LOC81691, DLKI, HLF, ERG, SOCS2, MYB, PPM1F, PRSS2, BAALC, NPR3, EREG, MMRN, 1QGAP2, C17 
MPHOSPH9. LOC5 1 659, SELL, MEF2C, TEK, RAB38, FUI 01 78, TRY6, N1NJ2. FU22746, BM046, ICAM2, MLLT3 

45 BCL1 1 A, HMMR, NAP1L3. MPO, A REG, SATB1, LGN, FU10713, ERG, PADI5. IGHM, HLA-DQA1, SCHIP1, ARHGEF6 
GUCYI A3, TMSNB, TYMS, TALI, MS4A3, GMFG, FLU , LPIN1, 6-Sep, C20orf42. TACC3, LOC81 558, MCM5, TRAITS * 
IL8, CXCR4, KIAA0186, RetSDR2, RAMP, MGC2306, LGN. CDW52, HMGA2, PTGER4, NUDTI 1, ZNF198 PCDH9 
FU10468. PSIP2, CRHBP, ICAM3, IL12RB2, KJF4A, DKFZp761P!OIO, FU12428, GPR56. CXCL2, PRIM1 BIRC5 
PLAC8, TFPI, H3F3B, HBB, NEFH, LM02, SV2B, 1TM2A, BRRN1, MCM2, MLLT3, H2BFQ, DOCK2 UBCE7IP4 ' 

50 ZNFNIA1,BCLMA,DDO,NRIP1,TARBPI,HBB,KIAA1750,F2RL1.NRIP1,FUI0719,CDC25A,VRKI,DUT PIP5KIB 
NR4A2, BCLI I A, BM039, HSPC022, 6-Sep, TOP2A, PDE4B, GIT2, JAM2, KIAA1939, MAP4KI, RUNX3, SELP, ANKT 
B4GALT6, BCE-1 , HBD. PECAMI , E2F3, FLT3. PIR51. TRAP-I. TFR2, P31 1. HSU79274. CLDNIO, DNMT3B, CDC45L, 
CDW52, PELI2. MGC86I, Clori29, BRCA1, HHEX. LBR. TOX, ITGA2B, FUI 1712, LOC8I691, PPM1F, STAC, CRYGD, 
MAD2L1, K1AA0379, ITGA4, PLAGL1, TALI, PF4, ELMOl, ITPR1, RNU2, SNTB1, RAD54L, HCGIV.9, LRMP, BRDGI 

55 ZNF22,CABCI.TEC,NR4A1,FU20898,FU21276,FUI0038,ITGA2B,ADA,SSBP2,RRM2,STMNI PSIP2 DSIPI ' 
NR3C1. RAD51, SCML2, STK17B. LCP2, MCM7, NT5M, FANCG, NR4A2, SCGF, KIAA0916. PRKCB1 , STKI8. PRSS21 
SEMA4D, KJAA0101, DLG7, FU10493, KOC1, PDZ-GEFI, ASB9, SCN9A, KIAA0820, FU23468, PTGS2, H1S1, GABPB2 
KLHL3, PRKCB1, H1FX, PDZ-GEFI, TKT, AKAP7, MST4. PERI. CKAP2. GSTMj, K1AA0582, PRKCH, AMD1, AD024 ' 
CD34, SLC27A2, FOXM 1 , RAGD, MEF2C, LOC5 1 334, EDG6, HMGB2, FU22690, CPA3, ANP32B. GNA 1 5, PRCl 

60 CXCL3, SAH, CENPF, PRKACB, KIAA0092, RFC5, MAP4KI, SPN, SORL1. RPS21, ALDH1AI, VRP, TFEC, KIAA0769 
SERPINB I , CTSW, KNSL1 , CBFA2T3, RNF2, KIAA07 1 1 , MSH5. CCNB2, PTPN7. FU22794, NASP. WBSCR5, RUNX3 
CDC42, NR4A2, MCM6, FU10719, HLA-DQBt, CI lorfB, BIRC5, NSBP1. PECAMI, WSX I. CCND2, E2F1 UPF3B 
LOCI 29080, STAT5A, KJAA0471, SCARF 1, KJAA0239, CASP2. PPBP, SFRS5, MCM5, SERPINB 1, HSPC157 
DKFZp564B0769, PFAS, C4S-2, BANK, H2BFA, HNRPA1, MPHOSPH9, SMCY, NUDTI, KIAA0841, MFNG HEC VWF 

65 TUCAN,RAB33A,FUI3949,HMMR.SRISNF2UGNAI1,H4FG,RTP80I,DACH,KIAA0918,SYK,CKS2 SLA ' 

HNRPDL. EHD3. SPN, TNFAIP3, MDMI, DJ434014.3, NASP. PMSCL1 , PLAGLI , RPIA, FU13912, FU20005 HERC1 
CDC2, DC1 1, ACYPI , TALDOl , MYB, TIF1 , DKFZP564D0462, ILIB, ING3, AMT. FU20047, GGH, PLAGLI PRKG2 ' 
DHFR, AND-1, ATP6V0A2, CDH7, RACGAP1, ITGB3BP, RPS14, TK1, POLA, FU20456, 6-Scp, SMC4L1 RYBP 
CHAFI A, HCAP-G, EZH2, POLE2, USF2. PR02198, BCL2, NUP98, ATP2A3, FU10604, AMD), SMARCFl, IL3RA 

70 RUNX1, FUI 2673, KIAA0084, KIAAl 157. HMGA1, COX1 1 , HDGFRP3. SS-56, POLQ. GRBIO, MSH5. DDX28 RRM 1 
CEB1. AS3, DNMT1, TCF8, C4ST, LSM5, TRIM22, KE04, NR2C1, KIAA0092, KIAA0332, KIAA0308, PSIP1, RNF8 
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NR3C1, TAF5, TTK, RBM8A, MGCI2760. KIAA0056, DHFR, ZFP36L2, RASGRP2, HE110, NAB1 , KJAA01 70, NAP1L2, 
K1AA0286, ABCF2, HYA22, PRJCACB, LAI Rl, 24432, DOC, TFDP2, MGC2217, HOXA10, KIAA1028. DKCI, CI lorf2, 
CI lorf21, SKP2, USP1, FUS2, DNAJC9, KJAA1 1 10, GAB2, ZNEU1, M6A, DLEU1. MAC30, DUT, HNRPD, S1AH1, 
FUI4280, KIAA0I79, TRIP-Br2, DKFZp564B0769, TIEG, PTTGl, FANCA, ESPLI, INGl, BIN2, KJAA0721, HYAL3, 
5 CENPA, LRBA, MUTYH, CAPRJ, PSMD! 1, FUU222. PDE4D, AKR1C2, BZW2, SLC27A2, ALDHSA1, BIN!, SLK, 
NFATCI , TFAM, MAPRE2, ABCC4, CA1, RBMI5, PRSS3, PR VI, FEN I, PCN A, LOC58504, OIP5, SMC2L1, ITSN2, 
TOP3A, FU23053, TIMM8A, APOBEC3G, TR1M9, RPAl, KNSL7, C5orf6, RBM12, MAC30, UBCE7IP5, CUGBP2, 
ARHGDIG, NRGN, SHCBPI , CG1-30, CDT1, DGKZ, RAC2, FU20272, C20orf42, SLA, MPP1, KIAA0682, 
DKPZP547E2 1 10, ARHH, KIAA1 172, KIAA0265, SOS2, HNRPAO, GIPC2, WASFI, MGC14258, HPRT1, KIAA0443, 
CD 164, K1AA1466, FU23151, FU 10450, DKFZP586A01 1, BUB IB, C20orf59, TFP1, KIAA0841, DATF1, SLC18A2, 
MGC 1 4258, CBFB, UBE1 L, SNRK, MGC26766, RAD52, SNCA, CHESI , KHK, LRBA, CGOl 8, MBNL, V AV 1 , BIN1 , HIC2, 
FU23018, HSU53209, ELA2, PTGER2, KIAA0555, CYFIP2, MBNL, CLC, AMPD2, CENTB1, PEPP2, ZFP36L2, CENPF, 
LEPR, C5, FU 12888, 1GLL1, TLK1, AKR1C1, 1APP, TIMELESS, DNAJC6, PR01331, TIF1, SF3B3, RES4-25, FU20641, 
TPST2, CENTB1, DUT, CD244, EP400, ZWINT, SNCA. GJA4, AVP, MRPL16, MAN2A2, HADHSC, 6-Sep, MAPK14, 
TAFIC, LY75, MELK, GMNN, NSMAF, BUB1, HGF, PRTN3, AK2, FU10335, SFRS5, ZNF215, FU12735, MGC5528, 
GABPB1, GP1BB, MYOZ3, RAB6K1FL, RFC3, OXT, SMC1L1, Nup43, PDGFC, RRP4, HTR1F, HPS4, ICAM4, STRfN, 
384D8-2, ANKRD6, ING4, JJAZl, KIAA0916, FXYD6, KIAA0981, HSPC056, FU1 1294, SPAG5, HSPC047, WFDC1, 
ORC6L, ZAP, GAPCENA, LMNB2, MGC2603, POLQ. SFRS7, MYOM2, FU10156, WEE1, DPH2L1 , MIRO-1 , POLG2, 
CHEK1 , SRPR, ST7, NEK9, ITM2C, JIK, PAICS, KPNB1, CGI-32, FU20105, PTEN, CDC7LI, FIJI 3262, ATPAF2, FGFR4, 
STAG2, UBE1 L, FU14007, KIAA0308, H2AFY, KIAA0451. FU21478, NFE2, GTL3, KATNB1, RIN3, ICAM2, CREBI, 
ABCB1, MGC4701, ATFI, LOC90355, FLJ10290, FU23392, FNBP1, SMARCE1, CES1, K1AA0419, FU20035, LOC51320, 
PRDM2, T1MM9, RAD51, PPM IB, HELLS, CHD4.MORF, TRIP13, NTSR1, LPINl, MAPRE2, ZNF278, HYA22, CG005, 
NPAT, MONDOA, LAPTM4B, RRM2, C20orfl, FU200IO, PRKRIR, SFRS3, DKFZp547I0I4. MCM3, PCNT2, NAP1L1, 
FU23476, MYBPC2, PA26, C6orf32, MGC13024, OPAI , RBBP4, BIN I, CAM LG, cig5, PLA2G3, KIAA0592, FU20094, 
HNRPH3, GEMIN4, FU13386, TKT, DKFZP434B 1 68, PMS1, FMR2, C21orf66, C19orf2, TFPI, DKFZP564O0523, LRMP, 
PPP2R2B, ZNF135, ZNF198, FBL, SCGF, CEL, LRPPRC, FU12903, FU10858, KJAA1041, KIAA080O, PCDHA10, JRKL, 
SUPT3H, ITPR1, POT1, C16orf3, CGI-48, FU22002, SFRS1 1, SYPL, MSH6, ZNF85, DLEU2, L1PT1, RFC4, FU 10539, 
LZTFLI , BMI 1 , CSF1 , COX1 1 , UBE2C, LOC93349, ATP2A3, GPC5, F2R, RPL28, TGT, TCERG1 , DDX34, LAMP2, CCNF, 
M96, CDC25C, LANPL, ADCYAP1R1, SUV39H1, FU14213, DKFZP434L0718, FU21269, PRAX-1, ANP32A, SRRM1, 
CDC6, FANCE, H2AV, C6orf48, TSN, FBXW3, CEP1 , ZNF161, SF3B3, CDC23, SFRS1 1 , CYLN2, IMPDH2, PIGL, H2AFJ, 
KL, TNFAIP3, MGC2306, Jade-1, CDKN3, FU10287, CSNK2A2, OPAI , TRAF5, RPP40, HTATIP2, ANP32A, WTAP, 
ESRRB, LOC51 185, MRE1 1A, H4FJ. KJAA0097, WAS, HMGB3, MCM10, NBR2, RPL3L, LAPTM4B, FU23277, 
HSA250839, C19orf7, MGC19570, C6orf32, APEX1, KIAA1387, FHL3, CG1-49, TMPO, CGI-127, TBC1D5, RBMX, SF3A3, 
FU10379, HADHSC, IGHG3. LOC254531, SFPQ, FUI0154, DICFZP434H132, KPNB1, WHSC1, PRSS3, CCNB1, CYP3A7, 
FU20244, RAB6IP1, SNRPA, LOCI 15648, BLM, FU20136, SYT1 1, CAT, USP15, PRPS2, UBE2D2, CENTB2, SRP72, 
TOPBP1, S1L, MAP2K5, SPG4, RENT2, SCAP1. GP1 BA, DNAJC9, TPO, ZNF261. TOP2B, PDCD1, IPW, SNX26, PTTG3, 
EN02, CNR1, DDX1 1, CRLF3, KJAA0092, KIAA0433, NBS1 , C20ori67, GP5, KIAA0I0I , BTBD3, GPRK.6, TLK2, 
FU20856, PKDl-like, RECQL5, ARHGEF9, FU1 1210. DKFZP564I052, PLCG2, BITE, HYPH, HNRPA1, ATPIIB, LIG1, 
KJAA1473, PTER, PPPIR16B, FU10597. KCNDl, FU22474, MTMR4, SMC5, FU20288, MED6, ULK1, DNM2, ZFHX1B. 
LRP16, FU1 1 184, RNF38, LOH1 1CR2A, NEDD4, AND-I, ITGA9, CDK2, PGDS, FU1 1896. FU13449. LOC93081, 
MRPS14, ANP32B, FU21272, KIAA0555, CDCA4, K1AA1966, FADSI, PRKCN, OGT, TR]P-Br2, KCNE1L, UQCRB. H1F1 , 
SCA7, RAD51C. HDGFRP3, FU10565. HINTI, AKR1C1, PTBP2, TCF12, CG005, MPHOSPH9, KJAA0953, OSRF, 
C14orl94, PNN, NGLYl, L1LRA2, CD79B. LANCL1. C20orfl6. CCNE2, MTCP1, PPAT, KIAA0800. KIAA1039, MGC5149, 
FU22843, FU12610. MRPS31, Cl4orf2, RUFY2. NCOA6IP, FBX04, PRKAR2B, TOX, HBOA, PMPCB, LOC51275, GFI1 , 
MGC21654, TGIF2, LARS, DKFZp547P234, NR4A1 , KJAA0036, PHKA2, MYST1, HSA9761 , AIP1 , TFAM, CDC20, 
CLNS1 A, THY28, ZNF145, FU20509, FU10890, MAX, FU203I2, ZNF305, C21orf45, ESPLI, ZNF292, VIP, FU13902, 
HA- 1, ARTS-1, AS3, H4FI, THEA, FRAG1, DNA2L, KIAA0240, OIP2, ZNF16, GOLGIN-67, GPR44, MTHFDI, IMPA1, 
GNB2L1, CNGB1, SYPL, PASK, PTDSS1, FU1 1342, MRPS31.CBX8, TTF2, DYRK1 A, CR2, RANBP2, FU20003, 
APOBEC3B, BCMSUNL, K1AA0725, PDE4D. PRH1, XP01.CML2, HYA22, IDN3, KIAA0261, ZNF175, YARS, CDC6. 
MOAP1, GLRX, ATP2B2, PPAT, FU20530, ZFR, COIL, KJAA1 100, PERI. PSTPIP2, TXNDC, PP2447, FU13197, CIAS1, 
JMJ, SYT1 1, H2AV, SPS, CUL3. FU23306, SNRPD1, FU 10876, NBR2, DKFZP434F03I8, SPI00, NIP30, BANP, SMC2L1, 
GPR2I, CSTF2T, HSA9761 , SFPQ, EFNA2, GRBIO, RPS20. KCNABI . FU32069, PUM2, RPL17, FU20499, HGF, CCND3, 
CTSG, ABCC1, PIAS1, PPARBP, DC13, SPHAR, SUSP1, C14orfl0, NPFF, PFKFB1. PAPOLB, H2AFY, SPRR2C, STAG3, 
CI lorfB, D6S2654E. INVS, ANAPC1, GPHN, DKFZP564O043, TM7SF3, UBE2E1, NAP1L4, RASA I. MGC 1 2909, DIAPH2. 
FAIM, UCHL1 , C10orf2, NUM A I , FU 1 0706, SSH3BP1. FU23560, ZNF137, MTMR2, ZFD25, PIGN, KIAA0252. MEIS1. 
SSRP1, ZNF363, NUP50, FU10315, UNG,COL6AI, ZNF10, 1LF3, DDX28, MGC4170, TSC22, MATR3, ARHGAP1 1 A, 
LAG3, LOC51231, C21orf33, K1AA0376, ZNF42, RERE, GalNac-TIO, NSBPI.CLEC2, RNPS1, MAP4KI, ADSL, SYNGR1, 
RPL22, FU10716, LHX6, FU10546, XRCC5, SP192. JJAZl. INPP5D, HPIP, LOC57019, DKFZp434N062, DEK, EIF4ENIF1 , 
ZFP36L2, FU13920, MDSl, KJAA0404. HMGBI , ILF3, SYNGRl, SIAH1, FADS2, K1AA1074, FUI2788, TAF7, KCNA3, 
CL640, KHDRBS1. FU12377, EDI, MTCP1, FNBP1. EPS15, BHC80, CHD1L, DKPZP434LI87, FU20477, SCOP, 
KIAA0470. ME3, QKI, SALL2, SON. CSF3R, HDGFRP3. EIF2CI , P53AIPI. PCTK2. PAI-RBP1 , ATRX, HTR2C, CHAF1B, 
NXT2, Nbak2, CDC14B. CCBL1, GTF3C3, DNMT2, SLC24A1, AND-1. FU13373, SET, USP4, CRSP2, NFRKB, P2RX1, 
SE70-2, CALCRL, DKFZP434D1 335, OSBPL3, TUBA1 , DKFZp434N062, DNAJC8, ALOX1 2, RTN3, KIAA0543, DNAJC8, 
AFFX-r2-Bs-phe-M_at, AXOT, PSMAL/GCP III, WHSC2, DMRT1 , TIC, AF311304, NPR3, C14orf93, FU10483, IMPACT, 
TGIF2, TNS, CAPN3. ZNF292, FU22557. KJAA0036, CGI-79, H4FA, TFDP2, UBL3, SLC22A6, CGBP, SNRPD1, SCGF, 
MRPS27, ZNF335, RBBP9, STK12, MAT2A, FU1 1 175. KJAA0528, MXD3, CPSF4. HINT1. PPIH. GNAOl, BRD1, 
KI AA0368. API S2. NAP1 LI , ST3GALVI, ZNF287, CYP2C8, ZNF291 , KIAA0582, GART, EPM2A, , , LOC5 1 1 94, FU2 1 269, 
EMCN. MGC4I924, USP2. HEMGN. MGC24665, ZNFNI A1.CDCA7, SHANK3, Evil, CDH26, FU20171,C4ST3, 
MGC21854, ST6GalII, CT2, WHIP, MGC16386, FU33957, BCL1 1A, FU33069, DKFZp762L03 1 1, ZNF6, DACH, CENPH, 
EHZF, NIN283, FU39957, DKFZP566N034, PTGS1, DKFZP586D0824, KIAA1218, MMP28. NID67, CYYR1. 5*OYl 1.1, 
BIC, CDT1, FU14503, B3GNT5, SDPR, 1TGA4, MGC16179, HOXA7, ROB04, GNAI1, DJ79P1 1.1, C1QTNF4. RAD52B, 
KIAAI726, FU30046, ARHGAP9, PRDMI6, FANCD2, C21orf91, UHRF1, OAZIN, FKSGI4, NIN283, EPB41L5, RAB39B, 
TFDP2, FU12994, PRJCACB, FU32009. KLHL6, FU10493, KJAA0748, FU21986, NOG, GPR27, EPCi, STIP-l. CGI-105. 
MGC12935, FU2Q09J, HSAJ1454, EVIN2, WAAI554, MGC20262, FLJ20354, MGC8721, EKJ1, MAMU, SEPP1, TRB®, 
CHD2, MSI2, DKFZP434 AO 131, KIAA 1 554, MGC20362, KIAA179I, TMPO, SYTL4, SH2F, KIAA1337, HNRPD, Rgr, 
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FU00026, IRF5. MGC4832, MGC34827, PRAM-1, GAB3, ING3, MGC7036. Ells I, DKFZP761M151 1, PR0163S, ZNF367, 
MYNN. SH2D3C. FU1 1220, HHOP. MCMIO, ONG2, FU20280, PUI 1252, RPL13. YR-29, KIAA1805, FUI4642. 
FU12892,CGI-67,0SM, EIF3S6, DKFZp761D221, PAPOLA, MCLC, LOC159090, FU20280, KLF12, LOCI44455, ALS2, 
WHSC1, STRIN, UCCI, FANCA, PTPN22, K1AA1677, FU23563. MDS006, HMGBl, MCC 10744, TIGAl, IL17D, SNURF, 
5 LOC221002, CED-6, I -Sep, CGI-105. LOC134147, FU39370, DRLM, LOC85028, P66, CASP2, SLC25A21, MGCI0966, 
PUJ32J4, DCLRBIB, C8TPJ, ATPAPI, PUOO026, CGorfM, NY-REN-SI, MOC23274, DKFZbS?) K0IJ7, BRD7. 
MOC2708S, KIAAI084, DKPZp434<30920, MOC45962, MLL, CYYRI, KIAAI387, PU23306, AF15Q14, RAMP. CCNBI , 

HSPC063. FUI 1220,C6orD3, NHP2LI , DKFZp76lNl 1 14, CGOBPI, USP16, KIAAI789, DKFZp434CI7l4, FU32194, 
TIGD3, FU32549, MQC20496. LCX. ARHGAP9, STN2, MCM10, GPR1 14. PPIL3. MJD, UBE3B, WHSC1. LOC51234, 

1 0 CLLD8, CI 5orfl 5. TTC7L1 , PRO2000, HEMGN, ELAVL4, KIAA1635, CLYBL, NLK, CLLD8, MDM4, MS12. ASE-1 , LSR7, 
LOC146853, TIGD7, HELLS, LOC159090, TAF9L, DKFZp762O076, FU32370, WDR9, HRB2, TIGD2, GAJ, LOC51 193, 
FU13614, BAALC, KCNKt 7, DKFZp3i3A2432, ARRB1, DKFZp762N0610, DKFZp564B0769, MGC45866, CGl-30. 
FU23277, ROCK1, TRA@, ARRB1, CUL5, DKFZP727C091, FU34817. FKBP5. FU0O058. FU900I3, FUI 1275, 
KIAAI21 1, FU13215. HSA9761, EVIN2, DKFZP434C245, MGCI6824. HSPC126, HSP70-4, LOC119392, FU35382, 

1 5 MMP28, ARIH2, SUV39H2. DKFZp761F01 18, FU10997, NDUFB1, MNAB, MU, FRSB, KIAA 1 87 1 , RARA, FU11712, 
MGC5306, FU30525, FU00005. LOC115330. AMBP, FU32942, LOC91768, PEC1, KIAA1959, MGC10744, FU90013, 
5'OY 11.1, LOC 1 1 6349. TSGA 14, Kl A A 1 954. HSPC 1 29. KI A A 1 1 94, KJAA 1 238, KJHDRBS 1 . SNRPE. SG1CL. FU31818, 
CNOT6L, KIAA0853, MGC39650, FU22955, CI 1ORF30. CKLFSF7, CGl-30. GRCC8, AP3M 1 . MGCI0946, CRSP6, AG S3, 
DKFZp564B0769, LOC81023, STAF65(gamma), ZRF1, LOC63929, HYPC, LOC90507, bioref, FU21438, MGC22679, HP1 - 

20 BP74, Jade-1 , RGM, CYCS, EG1 , C20orf92, TPC2, AUTS2, FU21918, ZNFN1 Al, MAIL. DC6. AUTL1 , TAGAP. STARD4. 
TBRG1, FU20354, LSR7, RARA, FU14936, FU12975, K1AA0379, RIG-1, PPP2CA. MGC15548, HNRPC, ZNF265. 
TRAP25, DKFZp564D177, MGC33864, HSPC 129, PPHLN1. HSPC 195, FLJ32020, WWP1, AKJP, TADA2L, 
DKFZP564I1 171, F1GNL1, GRP58, K1AA0141, LOCI 51648, FU20095, FU10997. IC1AA1545, TIGD7, PRKRA, FLJ20060, 
DKFZP434G156, FU14775, NAV1, RPLP1. B3GNTI.C21orf45. K1AA1586, ELD/OSA1, LOC5I249. K1AA1982. FU23309, 

25 ANAPC1, HINT1, MGC17919, TSGA 14, DRJLM. MCM6, KJAA1238, KPNA4, AFFX-r2-Bs-thr-3_s_at, 1GHG3. YARS, 

FU20309. LU, FU 1 0407, MGC 1 4797, KJAA 1 554, LOCI 1 5827, NRM, DNMT3 A, MGC4308, KIAA 1 554, MGC4 1917, ATE1 , 
TUFM, ROCK1. MATR3, KJAA13I1, FGD3, FU10876, KIAA1337. ZNFNIA4. PRO2000. SCAP2, FBX04. CNTN1, 
MYH1 1, TRNTl, TCF7L2. CDK5RAP2, DKFZp313A2432. GTF2H3, MGC14439. MGC4730, MGC19570, EIF2S3, RNF3, 
MGC13204, CHES1, CNNM3, SFRS3, SMBP, TMF1, CSTF3. HBOA, CDCAl. FU32745. SPIN. WHSC1L1, 

30 DKFZP566II024. FUI 4906, C20orf24. OSBPL7, NAALADASEL, HSA251708. KIAA0254, LOCI 44402, FU34231, 
KIAA 1228, C20orf72, RANBP2. and NIP30. 



Table 7C 

35 Genes Up Regulated in Passaged Tumorigenic vs. HSC 

FNl, FN1, RAD, KRT19. FN1. FN1, ITGB5, S100A8, S100P.CA12, TACSTD2, AGR2, S100A2. DC12. DSP, DUSP4, 
FU201 51 , IGFBP3, S100A9, CXADR. CYR61 , B1K, PTPRK, SERPINA3, ziziminl , CD24, SYN47, HRASLS3, LGALS3. 
FUI 1619, LCN2, RARRES1, GOLPH2, HRY, TFF1, EFEMP1 , STHM, 1FI27, SFN, MGC4309, ABCC3, DKFZp564Al 76, 
CD24, MY06, KRT7, MUC1, IER3, CTSL2. S100AU, MET, PROI489. C8orf4, PPL, CD24, GPRC5B, SI00A8, COBL, 

40 CDS1 . TACSTDl , TACC2, KRTI8, IL1 R2, SOX9, SPUVE, CAV2, TSSC3, C3, CYP! Bl . ITGB5, CD9. KRT6A. MAPKI 3, 
ARHGAP8, CDKN2A. S100A10, SFN. RDHL, SOX9, CEACAM6, FU20273, MGP, CAV1, F3. TGFB1, LGALS1 . MYO10. 
S100A14, INHBA, TM4SF1, CXCL1, TUBB, PPIC, FU10052. IL1RN, DPP7, FXYD3, GALNT3. KRT6A, ANXA2. ANXA2, 
FER1L3, ANXA9, TPD52L1, HRY, PTPN3. EFNA1. C8FW, CDH1. EPS8. CLDN4. PTPRF, CCND1, CALU, GALNAC4S- 
6ST, DKFZp564!1922, ASS. CAP2, FARPl, CRIP1 . LOC51760, HOXA1, MIG2, ANXA2P2. TGM2, MUC16, PAPSS2. SNK, 

45 RA1I4, CAV1, COL4A5. C4.4A. PTGIS, KJAA1078. SLPI. SARI, RARRES1. DUSP4, ANXA2. FU1090I, CD24, KRT6B, 
EPN3, ADAM9, EPHA2. TFAP2C. BMPR1 A, PARVA, SERPINB5, ENAH, MARCKS, FAT. BF. TACC2. FU20171. 
NCKAP1, TONDU. PIGPC1 , PARG1, EMS1, CTSL. L!F, EPB4 1 LI , 1SG20. ITPR3, LOC90957. CXCL5, PACE4, PHLDAI 
HNI, CXCL6, VIL2. Clorf34. GNG12, ALDH1 A3. TJP1, TM4SF6. ROR1. FU20151, LGMN, DUSP5, IRS1 , GFPTl , CD24 
ADM, GATA6, LAMC1 , NRCAM, CRABP2, ARHE, MCP. YAP1. ADFP, CARD10. COL4A2, EDG2. PTGES, OSBPL10, 

50 IGFBP3, KCNKI , RAB20, R1L, NFIB, EFEMP1 , CTSH, PDXK. SGK, DEFB1, KRTI7, RAB25, HUMPPA, C12oif5. DLG5, 
KJAA0869, SLC1A1, PPP1R14B, KDELR3. RAB3 1 , DDR1 , TSTA3, CDH3, TFPI2, PPAP2C. SLC12A8, TM4SF1. FU22662 
DDR1, S100A6, DD96, KIAA1078, VEGF, ARHGAP8. ELF3. RAB31. RIG. MAL, COL4AI, HBP 17, LOCI 13146. ERBB3 
RHCG, NR2F6, EMS1, MUC4. PLAB, STEAP, SI00A7, NET1, FUI 1856, MGC5395. GPR48. DLAT. R1N2, NFIB. 
CEACAM6, COR02A, TIMM17A, CLMN, FUI 3593, FARPl. E2IG4, IL1RL1. DSTN. CYB5R2. TIMP2. KRT8, GFPT2, 

55 POLR2J, SLC6AI4. ANXA3, LAMBI, FU21918, MGC10796, EPB41L4B, G0S2, SDC4. CCL20, TLE1. LAMC2. NMU 

SPAG4. TRJM2, RAB31 , EGFR, ZNF339, MGC35048. PLAT, PITX1 , ZFP36L1 . GMFB, PHLDA I , BNC. SLC 1 1 A2, LAMB3, 
TFPI2, FU22408. SAT, LAMP1, POR, TGFA, MY06, KCNMAI, TPM2, TUFT1 , GPR87. BZWI , KDELR3, ANKRD3 
EGFR-RS, AKR1BI0, RBPI, CDKN2A, CLDN1. AKAPI2, SLC7A5. SEMA3C. ERBB2, GPR64, PLXNB1 , C0X5B, 
MGC1 1242, FACL3. PPARD, PPAP2A. EMP2, CASK. MT1 H. TMPRSS4. PDEF. KDELR2, FU21610. TMEM8, GSTT1 

60 KREMEN2, ECT2, PFN2, MT1X, MT2A. HAIKI, CNN3. PTK2, ILIA, S100AI3, NDRG1, MIDI, TNFRSFI IB, SOCS5,* 
MATN2, ME1, SEMA3F. ARHD, PP35, ZNF144. MLPH. PDZK1. SCD. CRYAB. HSPC163. RRAD, IGSF3, PCBD, ITSN1, 
ILI3RAI, UGCG, EDG2, ANXA8, SSSCA1. LAMAS, KJAA0436, KIAA0599, ENDOG, SLC6A8, CALD1, FUI 1 183, 
MGC3101. UMPK, EFA6R, NQOI, PTK9. MT1L, ELF3, CST6. ST5. NET02, KIAA0802, MYOIB, NOTCH3 PTK6 
KIAA1416. MYOIC, SUCLG2, KRT17, RHBDL2, AMOTL2, COL7A1, IL20RA. CD14, CEBPD, SMARCAl. ESDN 

65 TNFRSF6. FU20591. PEG 10, FOXAl , KIAA 1 026. FU21870, PBEF, TOB1, AQP3, L1SCH7. TGIF. MYOIB. MPZLI * DDR1 
CP. IQGAP1 , P4HA2, BMPRI A, NEBL, PLEK2, EPHB4, AK3, BHLHB3. IL6, TAZ, PLS3. OSR2, SH3YL1 NQOI 
PPAP2A, UP, SBBI31, KDELR2, KIAA0790. FU10292, SLC2A1. AQP6. P2RY2, MTAP, FUI0718, DAF. MOB MKLNI, 
TM4SF6. SQSTMI. OCRL. C2lor(97, NMB, FU23186. SDCI. RISl. PTPRF. KLK10. SCEL. MGST3, CSTB. HOMER-3, 
PON2, CASK, SSH-3, DPP4. HSPB1, MGC2376, LOC92689, RARRESI , LTBP2, BNIP3, HMCS, TGM2, TNC. ITCH 

70 MRPS12, CTSB, SUCLG2, PPIC, SLC3 1 A 1 , MGC 1 4480, KIAA0440, EGFR, AK3. SRD5A1, FBP1, FU13984, UBE2H 
H2BFL. MGC3103. NPD009, FCGBP, CDK5, ANG, TEAD3, DPP4, PRRG1, NQOI, KIAA0429, SUCLG2, 1F2, EROIL 
CLDN3. SERPINEI,SFN, FHL2. HS3ST1. PDE8A, CLDN8, BAP29, RRAS2. RPL5. PIGU, PPFIBP2. DNAJB2, RRAS2, 
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N1D2, TOPK, MRPL19, NT5E, FNI, KJAA0103, CED-6. MAP4K4, PRSS8, COU3A1 , G1P2, ROR1, UGCG, BCAR3, ISG20. 
CYP24, LIM, LOC57228, SERPINE1, SLC7A8, TJP3, ESR1, NPAS2, CKAP4, CLDN7, UCHL3, KJAA0143. RBSK, FJXl, 
NOL3, SLC39A4, FU12910, BNIP3, PLP2, FU2253I, FU22028, JAM1, LMNA, K1AA0644, CUGBP1, VNN3, LAMC1, 
CX3CL1, THBS1, NUP50, SLC3IA2. NNMT, THBSI. AMMECR1. KMO, MAPKI3, KIAA1695, RCP, GTF2IRD1 , 
ARPC1A. MMP7, DKFZP434E2135, IF2. GLDC, PRSS1 !, TJP1, ATF3, PAX8, 1L13RA1. ATP6V1C1, TST, SHANK2, 
ANK1 . CRIP2, ChGn, GAS2L1 , EPHB3, N33, CD59, GEM, EIF5, CENTG2. OAZ3, ASPH, SRPK2, B3GNT3, EDNRA, 
HSPC159, BACE2. ATP6V1C1, DPI, EHD1, DNAJB1, YKT6, KLF8, DDEF2, SRD5AI, RALA, CYP1B!, GPNMB, 
DKFZP564A022, FGFR3, ACPI. FU20366, TLR5, SCD, K1AA0882. KIAAI028, SC4MOL, MPZL1, RALGPS1A, SARI, 
PTCH, SDRl, PDE4A, CELSR1, F12, FGF2, GCNT3, SNCAIP, DDR1, PBEF, MMP14, EGLN1, ELOVL1 , ADCY9, FST, 
KJAA0716, HSPA1 A, CNGAI, HNMT, KiAA0984, S1RPB2, HRHI, ITGA3, FASTK, LDLR, RGS20, MRPS17, ELM03, 
APIM2, TEGT, SH3GLB1, SMARCA1, UNC84A. GJB3, CAST. DKFZP564F0522, SLC19A2, HK2, ID1, ARNTL2, EVI5, 
KLK1 1, KIAA0703, NPAS2, MEIS2, CRIM1, GCLM, PARD3, EML1, RAD23B, AP1M2. S100A1 IP, YWHAZ, PON2, 
MTCH2, FU23153, TUBB-5, CDH6, SCD, KRT5, RNASEH1, LHX1, UBE2D1, TMEFF1, MGC4171, PGM 3, KLC2, TNF, 
HSKM-B, IDH3A, KIAA0874, FLJU773. PSMD5, HGD, PPP1 R13B, TNFRSF12A, FU 13841 , MBLL39, SH3BP5, FU22418, 
CETN2, CAST, IF2, LLGL2, SPATA2, SYNGR2. SLC16A1, FBX026, Clorf27, 1TGB5. LOCI 13251, KIAA1029, FU20623, 
SELENBP1, PCDH1. DAG1, TMSB10, SUDD,STK17A, LADI, SQSTM1, THBSI, ARNT2, CGI-1 15, TR1P13, DSTN, 
CTNND1, SOX13, SFTPA2, SLC2A10, CGI-141, MTIG, COL4A6, CTNNALl, RIL, 1L1RAP, SNRPD3, MAOB, G1P3, 
PIK3R3, FU2151 1, NAV2. CLDN3, VEGF, K1AA1609, MEF2A, SCARA3, CPD, FER1 L3, KMO, NY-REN-45, JAG2, 
OSBPL2, YIF1P, FU10055, PSMD12, GRIT, LOCI 13251, FBXL2, PRSS16, PTPRG, FOXE1, EMLl, GUK1, RH06, TPBG, 
HRB. H_GSI65L15.1, FU12571, MGC29643, SBBI26, MARCKS, PSMB3, SLCl 1A2, FZD2, KJAA0220, TMEPAI, MTRR, 
HMGE, BCL6, STK39. CELSR2, KIAA0895, ACPI, E21G5, KDELR3, CYP-M, ANXA10, ANK3, CLIC4, KRTHB6, TSTA3, 
MLF1 , TES, ASPH, PAPSS2, SLC20A2, RGS19IP1, NFIB, NPD009, HOXB7, FU10134, APOE, K1AA1219, KIAA0173, 
PODXL, IGFBP1, HSPCA, MAK, CI lortt, HIG2. CR1M1, FKBP2, HSPA1B, FU20624, CPD, ITCH, ENSA, UNC84A, 
K1AA0062, EPPB9, FU10851, STK6, PSCA, PTP4A1, DNAJC3, FU13782, CKTSF1B1, UAP1, KRT15, AXL, HMGCS1 , 
GNPI. PRKCI, MGC5509, MAGED2, CD63, FUi 1856, ADAM 10, KIAA0934, DXS9928E, SYNE-2, 1FNGRI, SLC7A1 1, 
RIG, PP1057, LOXL2, SPOCK, PTPRF, PACSIN3, ATP 11 A, STK24, CAPN2, C4BPA, FU1 1 149, TMP21, CYP2E1, 
COL4A1, PTP4A1, KIAA0937, PKP2, ARF4, KLF5, HSPA4, NPC1L1, ATP5J2, MSLN, TLE1, ARK5, SS18, SNARK, 
LOC56902, KIAA1630, JAG1, KIAA0843, CIS, MAP4K3, TAZ. PTHLH, RHEB2, NEDD5, HOXB7, MGC24447, EIF2AK3, 
UGTRELl, MIG2, ADK, GAL, FTH1, FTS, PEN-2, TNFRSF1 IB, CGl-148, MGC1 1061, LAMP 1 , MGC3985 1 , CPD, 
MGC1 1061, NCOA3, CDC42BPB, Cllorf24, MAP3K8, MGC3038, TRA@, 1RS3L, CLTB, SC65, KJ A A047 1 , PTS, POLR2K, 
CED-6. BLZF1, TRIM36, SPR, AP1S1, EVA1, LIMK1.TIMP1, K1AA0923. NDUFS8, EMP1, BFSP1, JAG1, GOCAPI, BID, 
RIL, CGI-90, CLTB, RIG-I, ANGPTL4, ATP1 1A, ITGAV, 1L1RAP, SH2D1A, FU22693, INSIG1, FKBP10, FU20847, 
DUSP14, VDR, IFRD1, TOMM22, POLR2K, IGFBP4. HSD1 1B2, PTHR2, PREI3, FU10769, AFAP, ENC1. MFN1, CD24, 
H2BFT, TR1M2, H1P2. JAG2, DAF, FU 1 0099, CRK, YES 1 , DLG5, RARRES2, LIPG, APXL, FU201 1 3, CYP5 1, CALM 1. 
MKI67, PLS1, VIP32, WARS, ABCA1, RASAL1, CDC42EP4, MYOID. CRA, H2BFB, KJAA0790, BOP 1 , TACSTD2, 
KPNA2, SGSH, RPP20, LAMP2, GRSF1 , CBLC, ZNF165, SCAMPI , PLOD2, GSTM3, CLTB, C2orf6. MST1 R, GSPT1 , 
CLCA2, SGCE, CHST3, CDC42EP4, NPC1, TPM4, HEBP2, WBSCR2 1 , HMGCR, ARL7, FU20623, DHFR, FU23548, IL8, 
DKFZP564F013, SECTM1, RAD23B, CFLAR, POU2F3, ITPK1, IGSF4, CBX3, RHOBTB3, PDP, HSPA4, WFDC2, TRIM 16, 
ARHD, KIAA0632, TCN1, ITGB4, KIF5B, SGPL1, RAD1, EIF2S2, CYC1, IL1R1, HARC, ICIAA0779, SLC25A13, PPARG, 
RAB 1 7, PLEC1 , DKFZP564A2416. C20orf97, DDX26, ALDH3A2, CGM2, BAG3, EPB41 LI , GS3955, FU20986, C14ort92, 
PP35, BTF, KRT7, FU20457, G10, EPS8R2, LOCI 603 1 3, MGC2376, KIAA0429, GOLGA2, GOSR2, COX 17, FU21313, 
FU10300. EIF5, SKD3, ADK, NPEPLI, SLC35A3, FU20I86, YWHAZ, UBE2A, CYB561, NR2F2, ELK1. FU13397. 
LAMP2, SGSH. FDPS. FU10534, PIK3R3, SPINT1, FUI1619. FU20989, ATIP1, SORD. PP, HCCS, SLCl Al , FU20739, 
SLC6A8, RBBP8, GRIK3, CALU, KIAA0644, SAA2, KIAA0934, USP1 8, TXNL2. FU 10521 , FBX03, SSBP1 , MGC3067, 
CGI-I00. MRPL13, PIG7, KJF3B, KIAA1735, DAAM 1 , ADAM 1 7, IL5RA, TPD52L1 , PPP2R3A, RAB9A, PAWR, HIPK3, 
PPP3CB, EPHA1, GFPTl, KJAA043I, C7orfl4, BNIP1, LMCD1, ATP6VIG1, COPB2, KIAA0265. RPL5, FU20234, OBP2B, 
MIR16, CTNND1 , ATP6V0E, DHCR24, FRK, MGC5178, IQGAPI, HFE, DKFZP434J214. ACTL7A, APBB2, LANO, PMM2, 
HMGE, ARHGEF4, NPTX1. CTSB, RPA3, NET-7, ARHGAP6. FU20637, FLRT3, FU 10407, RTP801, NR6A1, NR5A2, 
PTPN12, ZNF217, TEB4, CALD1, HSPC1 11, DPI, SNA12, STS, ANXA4, BRIX. MGC16723. MCP, FU22055,ClorH8, 
ACTN1 . TMEM4, FU2040I , SE57-1, SH3GLB1, CDYL, OAZIN, PR01855, H41, RAB22A, FU10326, PEX13, SH3BP5. 
MIF, SOATl. MRS2L, CDC6, PEPP3, FU14675, TPD52, CTBP2, SPINK1, PPP2R1B, SELT, TNFAIP1, IFRD1, SORTl, 
ATP1B1, QSCN6, PDK1. SNX16, VIL2. PMM1. C1B1, FU22195, SLC27A5. PCNP, TNFRSF10B, CDR2, FU21657, MTX1, 
SLC38AI, BC-2. PEX3, CIAOI, PLXNB2, RODl, RPL39L, TAFIB, ZF, C12orf22, DDX26. ME1. NPEPPS, DNAJB1, 
SLC39A1, AT1P1, MGC2742, BBOX1, FAM3C, FBXLl 1, EGR1, LIN7C, UBE2G1, MCP, TMPRSS3, MARCKS, LOC56902, 
GRAF. ALS2CR3, KIAA0680, FZD6, SPON1. HSPCl 1 1. CCNB1, P2RX5, B4GALT4, GOLGA2, p47, KOCl, RAB2. 
TM4SF9. MGAT4A. HS2ST1, CD44, FU20315, TCFL4, PCMT1 , BHLHB2, VRP, RBSK, FU10829, HES2, EK11, ZRF1 , 
C2orf6, TUBGCP2, PFTK1, BZW1, CYR61, NOL3, PTGES, CGMO0, BM039, SCRIB, DDX3, SVIL, SMC6, NET-6, 
KIAA1023. ATOX1 , IER5. IL1 R2, STX6, PKP3, PITX1, ETV2, MCCC2, MRPL33, MGC2494, BPGM, C22orTC, ACTR2, 
BCL10, TRAM, B7, FU12439, DKFZp564 A 1 76, PHKA1, SLC33A1, TGOLN2, HRC, LGALS8, FU22940, OBP2A, 
STOML2, IFNGR1, POLR2J2, DKFZP586B0923, SLC2A4RG, NDUFA8, K1AA0964, FU1 1269, TMPRSS2, PLEKHA1, 
UGT2B28, ARL1, PFDN2, IGU3, FU23516, KIAA1609, WSB2, KIAA1598. YESI, KIAA0284, ATP6V1D, VMP1. C22orf5, 
HSPA6. MUC1, MAPK9, PARD3, APG12L, RAB5C, PAK6, LSM I , INS1G I , NDUFS6, ALDH3B2, TNFSF10, FU20275, 
CHML, UBE2VI, IGF2R, ITGB5, SEC61G, LOC55831, OPTN, ORMDL2, GABRP, DPP3, FU20967, POP3, GPCl, 
ANXA2P3, PRDX4, CHPPR, DKFZp434G231 1 , LGALS3BP. UEV3, KRAS2, TM4SF1 1. FU101 16, CTBP2, CALU. USP3, 
P4HA1. SLC22A1L, FER, SLC1A7, PCDHAI2, ENCl, FU 14251, PPP2R3A, FU20069, DDXx. STK6, PLA2G5, ZYG, 
PPFIAt, AFFX-HUMGAPDH^33197 5_at, AK1, GNA1 1, WWP1, HRY, SMURF1, FOP, DHCR7, GCSH, HDGF, NCBP1. 
ETEA, KIAA1096, GMPS, TGFBR3, HSF2BP. ZFP103, CD44, C20orf24, PSEN2, PEX7, TNFRSF2I, ARHGEF7, CD2AP, 
ARF4. CHDIL, MGC8974. ZMPSTE24, PSMB5, ACR, GSK3B, NEDD4L, KPNA4, VIL2, CDC42EP2. UNC119, EPS8RI, 
K1AA0143, FU22709, LOC55862, YWHAE, BAZ1 A, WIT-1, IL13RA1, ITGB8, OS4, LRP3, DRILI, FASN, TXN. RASAL2, 
NCOA3, JUP, AUH, NEK2, GEMIN6, PSMDI I , RECQL, MAP7, SNX4, TPD52, KLK8, INPP5E, KIF1C, ORC5L, CDA, 
C20orO5, FU13189, B4GALT4, CDK5R1, Clorfl 6, ATP6V1D, K1F5B, CTNND2, CGGBPl.SQLE, PTP4 A 1 , CSNK2A 1 , 
LIFR, PLSCR1, SRI. CDC20, PSMB7, C20orfl8, NAT1, KLK5, KPNA1, PEL11. TRIM29, YWHAZ, KLF4, FU21916, LTF, 
DAPK2, DHCR7, RNMT, RXRA, SPAG1, DDX21. CKTSF1B1, OXTR, KIAA1096, COL16A1, CELSR2. K1AA01 II, 
TPARL, MLCB, STS, DKFZP586C1619, TPSB2, MEIS3, APBB2, HSPCI21, ASK, ABCB6, RBMS2, DKFZp762N19I0, 
CCNE1, FU22347, TEAD4, PPIB, NDUFS8, TMG4, BUB1, RRAS2, NOC4, SSH-3, TAXI BP 1, EPN2, ISGF3G, MRPL17, 
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AHNAK, TBL1X, EKIl, B4GALTI, SPHK1, PPIF, TXNDC4, DSC2, K1AA1096, SSR1, ATP9A, OSBPL1A. COX8. EIF2S1, 
SIP1 . ACPP, FU20085, SMARCA4, SSTRl , UNG2, C1GALTI , PRKCL2, CABYR, FU1 0232. SLC4A7, ARHGEF5, GLUD1, 
MED8, MAP2K1, PPMtB, NETI, PPP2R3A, RHEB2. PME-l, FU20591, FU22595. SPS, CPSF5, MGC5466, SLC35A2, 
PLOD2, DKFZP434B103, APPBP2, TFIP1 1, FUI0252, MRPS16, KCNK1 , GOLGA5, PAIP1, CHPPR, PA200, APP, 
5 FU23338, FU 1 3852, RHEB2, PK428, BAIAP2, LAMC2, C7orfl 0, LANCL2, ITGB1, HCCS, TPM 1 , FACL3, M RPS 1 5, 

EPPB9, ITGB1, FU10199, CSPG6, COPS7A. KRTHA6, SGPLl, EML4, AHCYL1, TPD52, SHC1, EPLIN, TUBBl, GAS2L1, 
MP2L1 , IDH3A, CYP4B1 , CGI-96, TM9SF2. FER1 L4, C10orf3. FU23537, LGALS8, P2RY6, ALDOA, PEX7, EBNAIBP2, 
DKFZP566C134, NPEPPS, PDE4DIP, GSG1, FU20485, MT1F2, PCTAIRE2BP, FU23510, LAMP I. KJAA0020, GMFB, 
ACTR2, HLCS, P4HB, CYCS, PSMD8, TIMM17A, MPTC, TXNL2, PNAS-4, COI-60, PMP22, TONDU, QOPSI, FU20604. 
1 0 TAT, FU 1 0803 , CLN5, NRP2, RPN 1 , K.IAA 1 7 1 8, CALM 1 , NOV, MAO A, TPS I , FU20555. Kl AA0649, TSLL2, OSBPL1 1, 
TPM2, MRPL40, TCF-3, H2BFT, SLC4A7, SURF2, L216, KIAA0471, DPMI, DNAJA2, COGS, DKFZP434G2226, DC50. 
TCEBI, ACLY, DUSP3, ROD1, NCOA3, NFATC4, GAN, UNC84A, UCHL5, FIJI 1850, RPP38, MYCBP, PDEF, 
DKFZP586N0721, KLK6, TPM, PSMC2, SLC16A1, TEAD1, VEGF, NDUFS1, BS69, MAGEA3, TLE2, HSPC051, FN1, 
BAZ1 A, FU22584, SEC23B, , , , NMES1, MAL2, P1GPC1, L0C55971, FU20171, ShrmL, LOC91523, FU22474, H19, 
1 5 RHPN2, MIG-6, NGEF, KJAAJ 165, YAP1. MGC4309, SYNE-1 , CDKN2B. ENAH, CTL2, ALS2CR9, TMEPAI, IMUP, 
DKFZP564J0863, UGCG, MGC12335, ITGB6, CYP4X1.GLIS2, FU20273, FU31842, LOC55971 , TMEPAI, SYT13, 
SPUVE, KJAA1244, HSJ001348, MGC29643, BOK, TEM8, FU30532, LBP-32, DKFZP761 L0424, FU23153, EDG3, 
IL20RA, MY05B, GJB2, MYEOV, PTK2, KJAA2028, SBBI31 , FU10052, AGR2, FGG. FAD104, LOCI20224, CLDN1 , 
LOC51760, IRX3, C20orfl00, CLDN12, MGC4734, EROIL, FU40432, MGC33630, NTN4, KIAA1522, SLC4AI 1, ESDN, 
20 DKFZp434C0328, PTGFRN, EHF, MFI2, PR01489, TCEA3, GNGl 2, TMPRSS3, TEAD2, GJB6, ALS2CR9, DDEF1 , CFL2, 
LOC116238, KIAA167I, SDCCAG43, MGC35048, TOBI, LRG, DKFZp761P0423, C20orfl29, SMOC2, FZD4, RDHL, 
WNT7B, MGCI4839, DJ667H12.2, TEAD1, RDHL, FU14957, ZIC2, HSPC163. DLG5, FUI4735, FU20048, WW45, 
FU90440, LOC92689, DAG 1 , L0C55971 . B4GALT1 , HAS3, PIGR, SNX9, AK2, PRO2605, UGCGL2, CDH24, GFRA3, 
FU13593, CP, CRBPIV, FHOD2, MGC26963. LOC129642, UACA, YAP1, FU23420, IL28RA, PSA, DKFZp434D02l5, 
25 PPP1R14C, PTGFRN, E2IG5, C14orf3l, FU10052, BCAR1, MGC22805, DKFZp434G17l, MGC1 1034, KJAA1870, 
FU22415, FU34633, GPR54, CHDH, FST, K1AA1708, UBE2H, DDEF1, WASL, FU14408, CXCL16, PARVA, 
DKPZP434H0820, CASPR3, RAB10, PDP, ANLN, FU25157, NETO2, OLD35, UBQLN1 , LOC58489, FU23867, E2IG5, 
ATP11 A, CD44, DNAH5, LOC128153, PHLDA1, IPP, DUSP16, COL12A1, MGST1, PLEKHAI, KIAA2025, LTB4DH, 
FU20739, FU22174, MGC24180, DKFZp761N0624, 1RAK2, ALS2CR9, MGC39329, AKAP2, C14orf50, MGST1 , UGCGL1 , 
30 KLK7, FU31937, DIRC2, FU10035, MGC1 1034, SOX7, PARVA, LOC139231, GPCR1, SDCCAG28, GPR92, LOCI47184, 
LOCI 13026, MGC14798, LOC147700, DKFZP434A13I5, FU10702, LTB4DH, PYPAF3, RBMS1 , SLC30AI, MTA3, ARL8, 
KIAA1688, RASAL2. PDK1, XPR1, SULF2., STEAP2, H41, METL, FBX032, TLEl, DDEF1, GPT2, MRPL30, FU141 17, 
DKFZp434E232 1 , MGC26963, SAT, ORF 1 -FL49, GRP58, MGC33662, NT5E, FU3 1 052, RNAC, CGI-85, CTL2, STC 1 , SCD, 
DKFZP434K0427, SCARA3, MGC14128, BCCIP, MGC3195, TGFBR3, PXMP4, KJAA1500, Spir-1, ARHGEF1 2, 
35 DKFZP434A0225, LOC55829, C20orf24, HSPC242 , CAMK2D, FAD104, ZD52F10, HS6ST2, HLCS, FLRT3, SDCCAG28, 
KLF15, C20orfl39, FU39155, MGC1314, C20orf24, FU1451 1. CGI-20, EDG8, MGC10765, C7orO, MGC 14801, FIJI 0697, 
ATP1B1, EHF, JUB, FU1 1200, MacGAP, H4FH, MGC1 1 102, RORC, COL12A1, PROl 853, MGC13096, SPTB, FLJ321 15, 
DKPZP566F084, SEMA4B, DKFZP434A0225, BTC, PCDHB14, CGI-09, EMS1, PCDHB16, K1AA1384, SCEL, GRP58, 
KIAA1357, CAC-1. SURF4, FIJI 1011, LMLN, ARL6IP2, OCLN, C17orf28, INPP4B, C14oH31, FU22558, FU101 16, 
40 KIAA1 363, DAB2IP, MGC35352, GK001, PDGFA, SNX8, MGC22805. LOCI 14990, ELP2, CXADR, LOC120224, - 

ST6GalNAcI, MGC35403, MGC39350, KPNB2, DSCR1L2, FU20333, PPP1R1B, EIF2C2, PX19, BPNT1, AD-003, LACTB, 
FU36445, ULBP2, GUK1 , KIAA1 321 , SPP2, CRB3, FU90586, NDUFB9, PDK4, FU30973, HSPC228, MacGAP, DEFB 1 1 8, 
DKFZp761 K2222, ASPH, MGC45474, UBQLN 1 , TRAF4, DKFZp761K2222, DJ667H12.2, AFFX- 

HUMGAPDH/M33197_5_at, C12orf22, RHOBTB3, MGC33974, KPNB2, C9orf5, FU32421 , FU25604, COQ4, FU20281 , 

45 FU 1 3391 , TEAD2, ELL2, RPS3A, FU335I6, ESPN, DKFZP434A0225, KIAA1684. TRA@, SEC61 Al, DKFZP434K0427, 
PR1C285, Kl AA 1 870, AM N , LOC151242, FU20686, FU10210, FU22415, MGC19764, CG1-97.CDW92, NAT5, IC1AA1126, 
CLMN, RAB18, M RPS 15, JAM1, TEAD2, ENAH, KIAA1228, ACTR3, PCDHA10, ATP5A1, GNPNAT1 , CL25084, 
LOC51260, CNN3, TFDP1, FU31528, KJa!a1434, FU10902, MGC14289, GGTL3, SYTL2, MGC21874, TIM50L, PHCA, 
PSCD3, K1AA1026, INADL, DNAJC5, AD037, FU1 1046, KIAA1804, KJAA1337. PPARD, K.IF1B, MIR16, ROD1, 

50 SLC2A13, CFL2, GDF1, MRPL36, SLC26A9, LOC51290, CABYR, HSPC159, SPPL2A, ABCC3, BTBD6, SMURF2, STK35, 
CGI-85, ZAK, DKFZp434B 1 23 1 , ICCNK.6, PCDHB2, Spir-1, KIAA0 146, ZNF265, COPZ1, FU20421, CI lorfl5, 
DKPZp761 D0614, KRT19, RAB23, MGC16491 , FU40432, MGC1098I , C20orf45, CTEN, MGC30022, NUCKS, MGC1 3251 , 
MRPL27, FU90586, MGC16028, FU90165; SHMT1, FU 14525, BACE2, ABLIM2, FU20719, SCGB3A1, MGC2477, 
FU20038, MGC29643, FU30829,C20orfl55, PGK1, FU37440, RBM8A, FBX022, KIAA1219, KIAA1200, K1F3B, 

55 MGC19825, AK5, C22orf20, FU10378, INADL, HSPCA, EIF5A2, RAB18, BCL2L13, MBC3205, UBE2H, FU20354, 
SLC5A7, FU30532, C14orf47, TMPIT, EHD4, FU 13089, MGC17299, IDS.CED-6, MGC27277, LOCI 37392, FXYD6, 
MGC22825, CPM, SNX9, MGC19764, TLR7, FENS-1, SDCBP2, NUDT5, MGC1 1 102, SEC24A, CG1-141, NKD2, EFG1, 
ANAPCl 1, MY05B, MGC14833, LOC85865, EPB41L4B, FU21415, KCNC4,GSBS,TEAD2, LOCI 15548, MAGl-3,C9orf5. 
CLONE24922. M RPS 1 5, RGNEF, CORTBP2, FU20354, HSPC121, NOC4, KIAA1673, MGC14595, MGC2560, MGC2408, 

60 MRPL14, APOA1BP, FU14681, MGC13102, KIAA1437, KIAA1126, MGC13034, CSEN, SH120, VIP, PRO2000, SLC31 A 1, 
AD-003, CALM 2, HT002, RAP2A, EMU. WDR5, MPP5, LOC90990, MGC2560, FU14431. ARHGEF5, HCC8, TCEB2, 
FU13I87, FU90575, FU10525, FU23393, HOXB9. LOC8466l,dJ55C23.6, HFE, MGC13040, WDR20, MRPL4, FU25604. 
DKFZP566C134, LOC55 871, CGI-09, MRPS23, MRPL47, MGC13045, ERK8, K1AA1500, HPS3, CRYPTIC, SBBI31, 
MGC14353, CGI-20, FHOD2, PPP1R14A. REPSl, MAPKAP1, V-l, FBX025, BNIP-S, MGC13114, EKN1, GPR24, RCP, 

65 FU12806, MGC2747, OBP2A. HM13, C21orf97, FU14909, C9orfl0. STYX. THOC3. RDGBB. PFKFB4, FU21924, 

KIAA1295. ZDHHC9, STXBP5, RPE, UBE2H, PCDHB18. FU20303, NPD007, N4WBP5, FU20333, FU12747, SURF4, 
C20orf45, FU12787, LOC90507, FU10839, EPB4IL4B, FU37953, BAP29, MRPL50, MGC10999, C9orf5, TBDNI00, 
STK35, FRABIN, JUB, PR02714, MLLT4, MGC40214, CPNE4, FU22233, MIZIP, MGC14859, MRPS24, HPS3, FU23841 , 
FU23577, HSPCA, MRPS10, FU14251, SSR3, MGC13186, K.1AA1453, HN1, HOOK3, ATP1B3, MRPL50, MAP4K.1, 

70 LOC90120, D1S155E, DKFZP564O0463, FU23816, CFTR, MGC40555, MGC2078 1 , FU2O085, NOPE, FU14825, MSP, 

LM07, C7orf2, MRPL32, FU10074, MAK3P, KRT6IRS, DKFZp547A023, SAMHD1, HSPC043. FU 10597, FACL6, LGR6, 
SORCS2, MGC4840. RAB35. MGC1091 1, and MLL3. 
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Table 7D 

Genes Down Regulated in Passaged Tumorigenic vs. HSC 

MEF2C, HSPC053, HOXA9, PRG1, RctSDR2, GMFG, AIFI, AIF1, H LA-DP Bi, PLC L2, ICAM2, HLA-DPA1, PTPRC, 
SPINK2, SPARC, CUGBP2, PTGER4, CECR1, CDW52.CCND2, LYZ, SELL, CD69, HOXA9, ITM2A, HLA-DQBl, ITM2B, 
LYL1, KJAA0125, LM02, ARHGEF6, KIAA0084, MPL, RGS2, LAGY, QKI, EVI2B, ZNFNIAl, DOCK2, HLA-DRB3, 
NAPIL3, HLA-DPAI, KIT, HFI, HLF, LST1 , ANGPT1 , CD53, LST1, FUl 4054, SELPLG, LST1, BM046, TUB A3, HLA- 
DQA1 , BCE-l, CDW52, FU10178, PRKACB, PRKCB1, IQGAP2, CHESl, GUCY1B3, PSCDBP, HLA-DRA, LAPTM5, 
PRG1, MEF2C, SLC2A5, LST1, FHL1, MAP4K1, TNFSF4, PLAC8, HLA-DQBl , 1GFBP7, PCDH9, MAP4K1, EVI2A, 
SATB1, MLC1 , SSBP2, FLU, CLIC2, CLECSF2, LY75, NDN, HLA-DRB1, FU21276, DLK1 , GLUL, NUDT1 !, BEX1, 
SH3BGRL, PRKCB1, MPHOSPH9, LST1, HLA-DQBl, FU22690, UQCRH, FU22746, HLA-DRB3, SLC2A3, NPIP, 
BCLI 1 A, MPO, RUNX3, ERG, SV2, HLF, MMRN, CYF1P2. HLA-DRB4, PECAM 1 , COROl A, MOX2, SEPPI, BAALC, 6- 
Scp, ITM2B, LCP2,PELI2,C17, IGHM, LRMP, PPP1R16B, HLA-DRB5. HBB, DJ971N18.2, LOC51186, SCGF, ERG, 
LAPTM5, P31 1, SAMSNI. ITGA4, DJ434014.3, IGFBP7, TFEC, HA-l, MAGED1, HSPC022, FNBP1, TCF8, ELMOl, 
CUGBP2, NGFRAP1 , PIP5K1B, DDO, MLLT3, ALCAM, NPR3, CMRF-35H, DPYD, PLAG1, BIN2, ITM2A, MYCN, 
GSPT2, LXN, ALEX1 , P1K3CD, ADAM28, PLAGL1, FLT3, WBSCR5, C6orf37, GUCY1 A3, CD74, KIAA0053, TRAITS. 
HLA-DQBl, MGC2306. ICAM3, PTGS2, H3F3B, TCF4, SNCA. FU1071 3, PROML1 , TEK, APOBEC3G, PR01635, HLA-E, 
JAM 3, UBE1 L, BCLI 1 A, GNAI1, LHFP, LST1 , CDH2, MYB, FU10462, ZFHX1B, CBFA2T3. TMSNB, H LA-DMA, PLCB1, 
SOCS2, CG018, PDE4B, MHC2TA, PADI5, USF2, CUGBP2, VIM, HLA-DRB6, TFPI, BIRC1, PTGS1, HFL2, SCDGF-B, 
LSP 1 , NRLN 1 , M PO, K1AA 1 939. PTGS 1 , MS4A3, HPIP, FU20220, HLA-DPA 1 , NCF4, MAPRE2, ZFP, BANK, TOX, 
CXCR4, IGHM, RUNX3, HCLS1, LOC81558, ARHGDIB, TRO, SCHIP1, CRHBP, KIAA1750, BCL2, FU20950, FU10097, 
DAB2, BASP1, JAM2, FU21616, HHEX. ITM2C, SPRYI, SERP1NG1, SLA. EBI2, ZNF42, DSIPI, FU10038, PECAM1, 6- 
Scp, CASPI, RBI, TACC3, 13CDNA73, 6-Sep, MAPRE2, FCER1 A, BTK, LOH1 1CR2A, LRMP, PLAGL1. MICAL. TCF4. 
CLGN, H1FX. WASPIP. LAIR1, ZNF175, INSR, FU20456, CI lorfB, K1AA0443, AKAP7, TALI, HLA-DRA. HRB2, PLEK, 
RAGD, PLAGL1, ALDH1A1, B4GALT6, GLIPR1, GAB2. KIAA1 157, PPMIF, WAS, SETBP1, MUF1, C6orf32, MYOZ3, 
TUCAN, RNU2, KLHL3. TSC, PKIA, MLLT3.NEFH, DKFZp564B0769, PPMIF, SNTB1 , PCDH9, CRYGD, MPPI, ABCB1, 
KIAA1 1 10, ALEX3, ATP2A3, KIAA0308, MAGEHl, BIMLEC, CTSW, SORL1 , FU20898, MCM5. CD244, PPPIR16B, 
MAGEDl, ASC, GIPC2, RASSF2, LOC8 1691, SCGF, PTEN, 24432, STAT5A, 6-Scp, SLC24A1, UBE1L, CD83, TAHCCP1, 
GNA15, NR3C2, K1AA0053, INPP5D, CPA3, GYPC, SYK, PRKACB, RUNX1 , RIN3, TRB@. NPIP. CABC1 , HLA-B, PGDS, 
CD34, SPN, LOC58504, MAGEL2, TBXAS1, MFNG, LOC91316, TRAP-1, RECK, TCEA2, FU20136, ARHGAP6. AMT, 
CAT, ADARB1, PTEN, LCP1, CCL3, SCN9A, RASGRP2, DKFZP586I2223, SS-56. SLA, C4S-2, PDGFC. LILRA2, RAGD, 
HNRPDL, ZNF288, ITGA2B, LOC81691, HBD. SELP, C6oif32, PDZ-GEF1, CPT1 A, KLF2, ZNF198, TACC1, HBB, Bl. 
C1AS1, HNRPA0, HLA-DQAl, KIAA0308, MYOIF, PROI331, RAB33A, TNS, NAPIL2, CERK, MGC4170. ADA, 
RNASE3, NFE2, ANKRD6, AKR1C3. CDC42, H1S1, TRIM22, B1N1, ICAM4, 1L12RB2,CSF2RB, EPB41L3, BRDG1, 
TNRC5, CIRBP, RPLP2, AMPD2, SFRS7, EDG6, BRCA1, MSN, HLA-DQBl, C5orf5, GSTM5, ITPR1, IL16. AIF1, 
NFATC1, LILRB2, FGF23, STAC, RPL22. PTEN, LRBA, PFAS. CGI- 1 16, DKFZP586A0522, MGC13024, GALC, ABCG1, 
MGC45806, ELF1, SAP18, ALDH5A1, ELA2, GATM.CHC1L, KIAA0918, LOC51334, FOSB, PR02198,TEC, SLCIA4, 
CAD, K1AA1028, VAV1, LOC57100, CI lorf21, SLC1A4, TRPV2. EPB41L2, FBN1.CD48, GIT2, CSF3R. DNAJC6, BINl. 
KIAA0582, ARL4, SH3BGRL, GLS, FXYD6, PF4, SCGF, NEK9, PKD2, MATK, BINl, NSBP1, MSH5, PRKG2, NT5M, 
PML, CD37, SF3A2, PLSCR4, CSK, HA-l , NUDT1. SIAH1, MEIS1, IGLJ3, HLX1, SV2B, DKFZP5 8612 223, KE04, ENPP2, 
CTSF, 1L1B, PSMBI0, IL1B, ZFP36L2. SFPQ, FUl 1 175, ATP2A3, STK10, FU2202I, MYOM2, PTENP1, MGC861, 
HERC1, Jade-1, BTEB1, KIAA1102, NPTX2. UCHL1, LYN, COL5A1 , ZNF215, MGC2217, SR1SNF2L, LOH1 1CR2A, 
RERE, COL5A1. RAP1B. CLDN15, VWF, HHEX, SMARCA2, SMCY, UBCE7IP4, LOCH 5207, KPNB1, ZNF22, STOM, 
C16orr5, ICAM2. KIAA1 102. CENTB1, DKFZP434C171, ITGAM, TFPI, CASPI, CLN2, TALI, AASS, SAH, FIJI 1712, 
FXYD5, KIAA0303. FBXL5, SFRS5, FNBP1, FU1 1749, MAGE-EI, SNRK, SPN, CTSS. SIAT1, SCARF 1, HSPC047, CD38, 
VAMP5, SF3B3, FU10374, FHL1, PTPRCAP, LRBA, DUSP6, PTPRC, KIAA0092, PLA2G4A. RBM5, FU21478, PLCB2, 
GOLGIN-67, RBM8A, OXCT, HEM1 . DUSP6, CRI1, RAB6IP1, IMPDH2, C21orf33, LOC93349. EMP3, NASP, MGC40204, 
PTGER2, COL5AI, SPARC, NISCH, SIGLEC5, CSTF2T, HGF, SNX10, DACH. NINJ2. MGC12760, K1AA1332, NPIP, 
KIAA0379, LYN. H2AFY, PACAP, PLCG2, PDE4D, LOC129080, FUl 1753, KIAA0447, BCL2A1, FUS2, PTPN7, WASF1, 
ZNF42. C18orfl, UROD, KIAA0303, NRGN, RNASE2, FU23056, FYN, DEFCAP, PTPN22. MAPKAPK3, ZFP36L2, AF1Q. 
NCF4, CDH7, DJ971N18.2, PA26. ANXA6. PHGDH. MCL1, LEPROTL1, HUMMHCW1 A, TNFRSF14, STK17B, CG1-49, 
MGC14258, PSIP2, CRJ1, FU35827. CCRL2, PTPRN2, CES1 , SCA1, FU21865, KIAA0798, BIA2, HLA-DQBl , UCP2, 
DPYSL2, FUU259, FU20312, KIAA0240, GTL3, C6orf48, AK2, TFR2, FU 13949, MAX, CHKL. FUl 2668, ALDH2, 
NUCB2, HPIP, RNF8, Clorf2l, AS3, ZNEU1, FUl 1323, FU23506, LOCI 15648, KCND1, STMN1, BTN3A3, MAP4K1, 
ALG12, ATP5G2, PETl 12L, TIAF1 , KIAA1043, TRPC1, THY28, SYTI 1, HSU79274, PRPF8, CLC, PCNT2, H2AFY, 
DAPKI, CCL4, RPL28, IFRG28, CCND3, C 1 4orf94, MGC3035, 6-Sep, GNB5, KIAA0916, EIF3S7, LENG4, FACL5, AP1S2, 
MCM5, DKFZp434N062, A IP 1. PROS 1 , CIRBP, REC8.SLK.C1 lorTC, dJ222E13.1, H2AV,NEK1,BNIP2.FU13197, 1TGA4, 
FU21269, KIAA0708, IMPA1, FUl 2750, SLC18A2, EMRI , KIAA0239, RPS9, ARHH, MCJ, ALTE, KCNE1 L, ABCB1 , 
RPL22, KIAA084I, LOC58486, SNX26, ADAMTS1, USP4, STXBP1, 1TGA2B, C5orf6, RBM10, FU21439, KHK, OS4, 
MAPKI4, NIP30, KIAA0471, SLCI6A7, RIN3, DDX28, HPIP. RNASE6, ADSL, ARHG. GNG7, HLA-C, RHOBTB1, 
CACNB2, DATF1, PDZ-GEFI , RPL13, TALDOl, DGKG. FU22794, PTPN6, SYTM. C5, FU22349. FGFR4, CGBP, PROL2, 
LARS, RPL3. JIK, MGC45806, MGC2488, MGC2752, TYMS, PECAM 1, NSMAF, ABCCI , LEPR, MYB, LAIR1, LOC57209, 
EP400, ALCAM, ZNF187, FUl 3386, KPNB1, LTA4H, HGF, PP1628, NRIP1, GNAOl, 1L3RA, CD79B, CENTB1, ZNF261, 
ST18, FGF9, CDK10. RAI17, STARD5, OXT. PML, KATNB 1 , ASMTL, NEDD4, ACTA2. MBNL, FU3I821, PERI, 
MOAPI. DCK, DXS1283E, SNCA, AD7C-NTP, MYBPC2, STX8, ATPAF2, ACYPl, RAD5ILI, CLIPR-59, FACL4, AASS, 
RAC2, MGC2306. SLC27A2, FD23018, RGSI.NAP1L1. ELAC2, LOC51 185.SGKL, PCDH16, TRAF5, K1AA0682, DGKZ, 
FU10539, PIGN. FU10647, NCOAI, LBR, GFU, MAN2A2, KRTAP2-4, HLA-C. FU35827, PCDHA10. HLA-A, APLP2, 
SFRS5, FU13262, WTAP, EFNA2, CI2orf8, CCND2, PTPRC, MPPEI, HMGA2, CLK2, SWAP70, PROI843, FU14280, 
FU23277, KIAA1 172, PRCP, MADD, SMARCA2, WASF2, MGC5149, CDC42. PLEK. SMARCF1. RCD-8, ATP9B, 1HPK2, 
IGHG3, DHRS4, EEF2, QARS, Ki AA084 1 , ADRA2A, RPL29, GCNTl, UBU, GRB10, IMP-2. ABCA5, HSPC157, 
TNFRSF5, H2AV, JM4, TBXA2R, SLC1 A4, RPS6KA5, IGLLI, MGC8721, PEPP2, USP7, PSMB8, ARHGDIG, HLA-A, 
RBMI0, NAP1LI, KIAA1393, AVP, KIAA10I8, RPL28, RES4-22.NAPIL1, ST13, K1AA01 86, MBNL, HEXA, K1AA0555. 
FU20I89, MNI, TSPYL, USF2, APLP2, ZNFI35, HPSl, RPS21, MAP2K5, HSD17B8, PROSC, NAPIL1, DUT, K1AA0I70, 
TPK1 , NY-REN-34, RBIG1, IL16, AKR7A2, STK10, PRP17, WWP2, PTD015, CAPRI, ARHGAP8, FU20856, APPBP2, 
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LRRN I , MDM 1 , HLA-DMB, CGI-30. COX 11, DDX28. ACK1, TM7SF3. FU23554, SDCCAG8, FU20094. MMP28, 
MUTYH. CAI, AKR7A2, WDR6, DYRX1 A, DPH2L1. RBPMS. PU20005. MAP2K3, C4ST, FU22059, FU20202. H2BFQ, 
CAMLG, CHAF1 A, ABLIMI, MAPKI I, RAPI40, DUT, ITSN2, EHHADH, DKFZP547E2 1 10, H2AFJ, MGC4659, RPL13, 
KCNA3, BC008967, CASPI , NMI, NBEA,NUMA1, DEF6, PRAX-I, TBC1D5, KIAA0332, NEW1CP, KIAA0769, CENTB2, 
5 CKIP-l, EIF4A2, OAZ, ARH, KIAA0467, Cl9orf7, KCNAB2, TTLL1, FUI0597, SF3A2, FIJI 1222, PSTPIP2, BCL1 1 A, 
SPHAR. GLIPR1, KJAA0555, MMP2, EIF4A1. STOM. ALOX12, FU1 1588, RBAF600, PROSC, CG005, VILL, FUI 2707. 
M6A, TCIRG1, HTRIF, R1CH1, F13AI. CACNA2D3, RRP4, TAF7, ZNF134, HSU53209, LZTFL1, TKT, LILRA2, 2NF302. 
FU131 14, ZNF177, PURA, DKFZp547I0l4, TXN2, TLR3, BHC80, MGC5I39, PTPNSt, ZNF145, THTPA, BTBD3, 
MDS010, KIAA0924, 2NF292, ITGB2, TJP4. GPRK6, CYLN2, ENPP4, ALB, RPS20, FOXOl A, ADH5, CTSS, FU23221 , 

10 CI lorffi, TNFSF13. TOLLIP, K1AAI449, H1NT1, GLTSCR2, KJAA1052, FU 10260, RAB3GAP, HINT1, TAPBP, CHD5, 

LOC57406, TP53TG1 , SRP46, MS4A4A, NUP62. PIM 1 , 2NF42, COG4, ADPRTLl , ZNF289, CATSPER2, TXNIP, PDE4DIP, 
HSA250839. FUT4, HSPAI L, GALT, MGC4278, APEX1. FN5, STRJN, USP1 !, SPP1, NPFF, CEP1, GAPCENA, HLA-E, 
SCAND2, CG005, VRP, BRAP, GPR56, MLH1, GPR105, OGT, C1R, BTN3AI, FU14I07, PACS 1 , MGC26766, FU22378, 
APOBEC3C, CG005, CAI I, QDPR, DUT. ALDH6A1, FU10450, BST1, NGLY1, FU12057, FECH. ZNF137, SERPINB1, 

1 5 EZH1, CASPI, MGC3265. CXorf9, TRG@, DKFZp564B0769, KIAA0616, DIS155E, MN7, CI 8orfl , NSBP1 , NXFl , FHL1 , 
TOP3A, TARBPl, K1AA0766. RRAS, SEMA4D, CEBPA, TIP120A, IL1S, HADHSC, HIRJP3, CTBP1, DVL2, RBM12, 
RAD54L, NYD-SP15, PHCl, K1AA1042, IGL@, NPR3, HRMT1L1, FU20551. MYST1, LOC51231. TCF12. KIAA0543. 
MKPX, LOC51 157, SYNGR1, AICR1A1, SCOP, LRRN I, FY, AMY1 A, PHEMX, K1AA0930, MAP3K3, FU10631, ZNF85, 
APOL3, MAPKI 2, TRG@. POLD1 , LDOCl , POLA, TPST2, WASF3, RPL1 1, MKL1, FU22242, PTPRM, AMHR2, 

20 FU20288, TERF2, DOK4, KCNABI, DISCI , FU22494, LOC91316, VIP, POLR2A, RGSI9, C12orf6. RPS9, LIG1, NASP, 
ARHGEF9, MANBA, SARM, SRPR, CDH9, MRPL16. FU20509, SNRPN, HLA-E, NTS, ZNF232, FU 12903, PHKA2, 
MSH5, PURA, ATP9B, TRIM28, FU12768, ME2, IDS, MPHOSPH9, DIA1, ADAM8, HADHSC, STX12, COX15, RPA2, 
SHANKI,GGA1. LANCLl , UBE3A, SOX1 1, LAT, BCL7A, DKFZp434KI2IO, BRAP, SMARCC2, DKFZP434H 1 32, 
NHP2L1, FU1 1294, FU12270, KIAA1649, SRP46, PSMB9, GGA1, MGC4368, TOP2B, PTK2B, FU13912, EZH1, THRA, 

25 BAX, NAG, MERTK, HADHA, SRRM2, HNRPH3, GNG7, HSPC018, FU22573, H?CAL4, MBC2, MAPK4, FU107I6, 
ITGAL, NFRJCB, MRP63, DKFZP434LI87, GABARAP, CHD4, DKFZP564D172, FGL2, LOC57019, KIAA0478, NTSR1 , 
LPIN1. USP4, KIAA039I, ASGRI, KIAA0174, TBXA2R, TRAP95. FU22649, NEK3, ZNF271, SIL1, 76P, CYLD, CD164, 
TINF2, ZNF220, DAB2, HRIHFB2206, SF3A3, TRO, FU13373, UBE4B, GC20, ADAM28, PHKB, BCAS3, MGC14258, 
RAD52,HLA-F,KIAA072I.MRC1.CHD1L,LMOD1,FU10315,CHRNA7.NAP1L1,PIB5PA,GADD45A.RPL35A, 

30 LPIN1, TFP1, FU14213, K1AA0746, KJ AA098 1 . C22orf4, PP1044, ABCF2. FU10379, RASSFI, FU23392, RPS8, DAB2, 

FU1401 1, CDC2L2, GAD1, MGC17330, FU23342, HEI10, NPDCI, KIAA07IO, BIRC1. KIAA0349, SF3B3, MST4, IRAK3, 
CD81, LOC57406, FUI2610, SFI, SLC27A2, KIAA0804, KJAA1055, GTF2F1, SEPX1, SCAMP2, PPP3CB, U5-200KD, 
HMGN2, F2, PCBP3, FU20721, ING4, HADHSC, KJAA0286, TREXl, ATPUB, RUFY2, SUPT3H, SFRS1 1, P1AS1, HBOA, 
HAS1 , HYMAI, NUP210, TGT, FU1 1 896, CIDEB, TRHDE, FU90524, TOX, K1AA0261 , GSTM2, GAS7, MBDI , 

35 KJAA1305, PPP2R2B, CDT1, FUI1 164, TMPRSS2, TYROBP, G6PT1 , PRIM 1 , GP5, DKFZP566H073, RPS14, CCNG1, 
FANCG, CM AH, SORBS 1 , KIAA0800, C1QTNF3, UBCE7IP5. FXR1, ZNF334, CNN2, RFC5, ACAA2, GNB1, FU22757, 
CDKN1C, UROD, KIAA1028, HD. CTSG, CLNS1A, P2RX1. TACC1, ADH5, RPL13A, ZNF363, PRXCH, AF020591, 
LOC51659, PERI, TFPI, TSN, BMIl, K1AA0625, MLLT2, TAFIC, DHFR, SLC23A1, HAGE, NAP1L4, EGFL3, SCA2, 
FU20489, SNAP25, USF2, CRYL1, GG2-1, EDN3, TRPC1, AP1S2, ERCC I , KJAA0582, RPL15, LOC54103, FU22557.CGI- 

40 127, CSNK2A2, ZNF278, EDG5, IPW, RASGRP2, SAEI, KIAA0725, RTN2, CTNS, FU20274. FUI0276, LTBP4, FU 10539, 
HYAL3, MTL5, MGEA6, BNIP3L, PARVB, MGC15523, KCNK7, IGHM. PASK, KJDINS220, PCM1, KIAA0092, ASB9, 
MAP3K4, CD1B, COL6A1, HCA127, ZNF262, GG2-I,CAPN3. SAP 1 8, EIF3S5, ZNF337, E1F4AI, DBT, CROT. FU10474, 
FU10483, CBX8, DKFZP586M1523, CCRJLl AP, FU14I53, KIAA0397, COL2A1 , CD164, TLE4, PRO2730, ATM, RFX5, 
KJAA0515. FU20542, HYPH, ERG-1, DBH, SCML2, GNAOl, WDR13, GCA, FU23323, FU1 1362, CGBP, MGAT1, 

45 HMGB2. NDUFA6, KIAA0515, KIFI3A, OPA1, BRD1, ATP2B4, PSME1, KIAA0931, HPS4, K1AA1966. DKFZP564J0I23, 
DBY, HUMNPIIY20, MAT2A, DFFB, FU20294, ADSL, CSTF2T, , ZNFN I A 1 , LOC5 1 194, FU21269, DJ79P1 1.1, BCAT1, 
MGC21854, DKFZP586D0824, EMCN, C21orf91, SDPR, PR01635, ITGA4, FU20171, ROB04, ZNF6, DRLM, TAGAP, 
PRDMI6, ST6GalH, GNAI1, EHZF, MGC10966, ARHGAP9, HEMGN, GNG2, LOC83690, PTGS1, MGC41924, USP2, 
FU33069, CT2, C4ST3, PRAM-1, FU32122, SLC1 1 A3, BIC, TNFSF13B, FU37080, FU35564, KIAA1913, CDH26, 

50 BCL1 1 A, FU30046, MGC7036, DKFZP566N034, RARA, Clorf21, PAG, SH2D3C. FU00026. STIP-1 , FU39957, KLHL6, 
VIK, FU34922, SHANK3, FU00026, PTPN22, HRB2, ZDHHC2, DKFZP566KI924, SYTL4, DACH, FU21986, EVIN2, 
GAB3, CYYR1, MMP28. EHZF, FU00058, LOC93589, KLFI2, CLLD8. KIAA1218, MGC16179, HS3ST3B1, ARHGAP9, 
LOC144402, LOCI 14928, FU39370, PRKACB, MGC13I05, Ellsl,CGM45, EPB41L5, RAB39B, LOC145553, HRB2, 
SDCCAG33, ARRB1 , EEF 1 A 1 , MGC 1 2992, BBX, DAP10, CMG2, GPR27, GBP5, FU20202. UCC1, RAD52B, KJAA1554, 

55 AKNA, TBXAS1 . al/3GTP, JAK3, B2M, MGC20496, CLLD8, A LEX 3, FU2I438. MJD, FU22570, API S2, TFDP2, P5CR2, 
Clorf2l, KJAAI554, Evil, MGC8721, FACL5, CYSLTR1, CTSS, Rgr t NlD67, FU32194, MGC45400, KIAA 1 789, DCPIB 
MGC4251 , CPXM, SMBP. PARVG, ESRRBL1 , C6orf33, MGC20262, C6orD3, MGC27027, LOC51234, ZNF33A, RGS1 8 
KIAA1607, TIGAI, HOXA7, NAALADASEL, ATP8B2, CLYBL, DKFZP727G051, KIAA1214, WHIP, IRF5, UBL5, 
KIAA1946, GLTSCR2, CMG2, OSM, KIAA0748, FU1 1 1 13, FUI2994, EROI-L(BETA), NUCB2, KIAAI337, DEF6, POLH, 

60 FUI 1712. LOC91526, TTYH2. ACRBP, MAML3, FU00OI2, C6orf37, MYH1 1, C9orf24, HNRPD. CCNDBP1. 

DKFZP434L0I 17, GPR1 14, ANKH, MGC13170, NOG, CXorflO, C1QTNF4, NAVI, RPIB9, DKFZp571 K0837, SFXN1 
KJAA1497, PHACS, PAPOLA, ELAC1. MDS006. FU14167, LOC136895, CGGBPI, MGC45962. CGl-85, AUTS2, FXYD5 
FLJ32009, FGD3, HSAJ1454, GRP58, K1AA1954, ELD/OSAI. PRexl, MGCl 1324, FU90013, NIN283, HCA127, 
DKFZP564D1378, HMGB1, TRB@, MGC4796, ASE-1. YR-29, FU25476, CGI-67, STK33, SLC25A21, ZNFNIA1, DRLM 

65 PP2135, STMN3. CAMK2G. MGCI6169, DC6. GCNT1, PRO 1635, STRIN, DLCl, DKFZp761 D221. FU10656, ZNFN1A4,' 
SENP7, MGC34827, MGC15619. FU32942, RPL28, FU00005, FU23462. DKFZp762L03l 1, FU30726, MGC3200, ARRB 1 
EIF3S7, HSA976 l.FU 11896, MGC 1 0744, KIAA 1309, WDR9, KIAA 1587, MIR, FUI 2953, MGCl 2921, LOCI 30617 NAVI 
HPSE. FU20085, KJAA1982, KCNKI7, K1AA1495, LOC64744. AUTL1, LOC9I689, SEPP1, PPP2CA, KHDRBS1, DREVI ' 
MGC35274, SNRPE. LOC91689, KIAA0853, FU13215, TACC1, MGC20262, MGC17515, MGC40157, DKFZP572C163, 

70 PRPF8, HINT1, FUSIP1, MEF2D, C20orf24, TADA2L, NIN283, FS, HSPC063. ALS2, NHP2L1, LGALS12, MGCI0986 

KIAA1871, DKFZP434A013I, KIAA1949. DTNBP1, GPHN, SUV39H2, BRD7, FU32001, HYPC, EEF2K, ESRRB, ZNF226 
ILI8BP, CSRP2BP. HEMGN. FOXPl, SGKL, FU11220, TRIM4, FU21918, KIAA1545, MGC2474. CDCA7, HSPC002, 
LOCI 15294, LOCI 19710, GTF3A, TAGAP, TCF7L2. FU22690, OAZIN, TRAPI , MGC42 174, MGC9850, KIAAI632, 
HSU53209, BIVM, BAALC. WHSC1, CI6orf5, KIAA1238. MRS2L, CGI-105. ZDHHC2, LOCI43903, DKFZp762N06IO, 

75 NSE1, OSBPL7. HAVCR2, ASAHL. KIAA1798. TLR4, MGC10946, PRexl, FU3I340. TAHCCPI, C20orfl4|, FU203I3, 
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TAF9L, FRSB, PRKRA, P66, KJAA0141, RARA, BANP, FU00007, DTNBPl, LRP5, KIAA1337, MGC29667, WHSC1. 
MMP28, EVIN2, Cab45,CED-6, PTER, ZNFN2A1, NDP52. CHES1, KIAA1635, NFAT5, FLJ32332, HTRA3, MAP4K1. 
K1AA1337, AP1S2, FU23306, HP1-BP74, KJAA1218, BTBD4, DKFZp761F01 18, MGC16703, BAZ2B, MU, FU13614, 
MYOl 5B, OAZIN, LOC92799, CANX. SUFU, KIAA1954, AGS3, LAPTM4A, HP1-BP74, FU23467, FU 12892, MGC40042, 
5 K.IAA1 143, RPL1 1 , LSR7, CENPJ, NY-REN-58, NRM, FU23563, WASF2, AM BP, N1P30, EIF2AK4, MGC15429, TTC7LI, 
NICN 1 , FXC1 , FU20793, SOC, RPL1 3, HYPC. CLONE24945, MGC24663, TEM7R, FU 14768, DKFZp667M24l 1 , STARD9, 
FOXP1, ELP3, KJAA1337.CDA017, PPP6C, PAK1, FUJI 0876, EPC1, ZNF397, C21orf63, KIAA1805, MIR, CYYRl, 
DKPZp564B0769, EPSTII, MDM4, MGC23947, MGC14421, SDCCAG33, DKFZp762O076, LOC93109, STN2, HSMPP8, 
FU20265, LOC85028, MGC15435, 1-Scp. MGC41917, MS!2, Jade-1. IL17D, MGC2752, MATR3, PRKRA, 

10 DKPZp434C1714, MGC44I5, DKFZP727C091 , MY038, FLJ35453. FU30794, DJ462023.2. FU90130, FU22283, EEF2, 
LOC155066, ATPAF1, FU23499, STAM2, LOC85028, FU21709, LOC51279. TRA@, JAM3, SIAT6, KIAA1453, E1F2S3, 
LSR7, ROCK1, DKFZP566I1024, FANCD2, MEF-2, MGC2664, MGC15548, ZNF75A, HSPC126, EIF3S5, RBM7, FU20280, 
GSTA4, SEPP1, T1GD3, DKFZP434 A 1319, MCLC, MGC14136, DKFZP762N2316, LOCI 15330, D4ST-1, UCP4, PRMT6, 
LAK, NFN, FU10997, RAB4B, LM04. RRN3, CENPH, FU23277, GBTS1. FU90013, LOCI 15509, PP2135, FLJ36175, 

1 5 SPINO, PAIP2, DKFZp761G0122, ATF7IP, WBPI, MGC29937, MGC9564, CASP2, TIGD7, C4S-2, MGC25181, LOC89887, 
KIAA1387, FU22283. GIT2, MIR, SSBP3, LOC159090. U5-200KD. FU10997. ZNF295, PGBD1, HEL308, POLH, AP3M1, 
NORE1 , SEMA6D, PPID, CUL5, LOC91663, FU1317I, BAT4, RPLP1, K1AA1630, CT2, HSPC182, HMGB1, FU20280, 
FKBP5, EIF3S6, CI 5orfl5, TRPC7, FU3I153, TA-1CRP, MGC17919, AP2A 1 , C20orfl 32, SECP43, PPIL2, FU 14494, YARS, 
MGC10974, CLN6, C20orf8l, U2AFI, K1AA1238, FU23861, LOC144455, DKFZp564Dl 77, NIP30. TBC1 Dl, ZNF265, and 

20 PPP4R2. 



Table 8 

Preferred Solid Tumor Stem Cell Cancer Markers 

25 Bmi-1, eed, easyhl, easyh2, mC, yyl, smarcA3, smarckA5, smarcD3, smarcEl, 
mllt3, frizzled 2, frizzled 6, frizzled 7, mf2, Frizzled 1, Frizzled2, Frizzled4, 
FrizzledlO, Frizzled6, FZD1, FZD2, FZD3, FZD4, FZD6, FZD7, FZD8, FZD9, 
FZD10, WNT2, WNT2B, WNT3, WNT5A, WNT10B, WNT16, AXIN1, BCL9, 
MYC, (TCF4), , SLC7A8, IL1RAP, TEM8, TMPRSS4, MUC16, GPRC5B, 

30 SLC6A14, SLC4A11, PPAP2C, CAV1, CAV2, PTPN3, EPHA1, SLC1 Al, CX3CL1, 
ADORA2A, MPZL1, FLJ10052, C4.4A, EDG3, RARRES1, TMEPA1, PTS, 
CEACAM6, , NID2, STEAP, ABCA3, CR1M1, IL1R1, OPN3, DAF, MUC1, MCP, 
CPD, NMA, ADAM9, GJA1, CD14, SLC19A2, ABCA1, PCDH7, ADCY9, 
SLC39A1, NPC1, ENPP1, N33, GPNMB, LY6E, CELSR1, LRP3, C20orf52, 

35 TMEPAI, FLVCR, PCDHA10, GPR54, TGFBR3, SEMA4B, and PCDHB2. 



Additional solid tumor stem cells cancer markers can be identified, for example, using 
the methods described in Example 4 below. 



40 IV. Detection of Solid Tumor Stem Cell Cancer Markers 

In some embodiments, the present invention provides methods for detection of 
expression of stem cell cancer markers (e.g., breast cancer stem cell cancer markers). 
In preferred embodiments, expression is measured directly (e.g., at the RNA or 
protein level). In some embodiments, expression is detected in tissue samples (e.g., 
45 biopsy tissue). In other embodiments, expression is detected in bodily fluids (e.g., 
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including but not limited to, plasma, serum, whole blood, mucus, and urine). The 
present invention further provides panels and kits for the detection of markers. In 
preferred embodiments, the presence of a stem cell cancer marker is used to provide a 
prognosis to a subject. The information provided is also used to direct the course of 

5 treatment. For example, if a subject is found to have a marker indicative of a solid 
tumor stem cell (see, e.g. Tables 4-8), additional therapies (e.g., hormonal or radiation 
therapies) can be started at a earlier point when they are more likely to be effective 
(e.g., before metastasis). In addition, if a subject is found to have a tumor that is not 
responsive to hormonal therapy, the expense and inconvenience of such therapies can 

1 0 be avoided. 

The present invention is not limited to the markers described above. Any 
suitable marker that correlates with cancer or the progression of cancer may be 
utilized. Additional markers are also contemplated to be within the scope of the 
present invention. Any suitable method may be utilized to identify and characterize 

15 cancer markers suitable for use in the methods of the present invention, including but 
not limited to, those described in illustrative Example 4 below. For example, in some 
embodiments, markers identified as being up or down-regulated in solid tumor stem 
cells using the gene expression microarray methods of the present invention are t 
further characterized using tissue microarray, immunohistochemistry, Northern blot 

20 analysis, siRNA or antisense RNA inhibition, mutation analysis, investigation of 
expression with clinical outcome, as well as other methods disclosed herein. 

In some embodiments, the present invention provides a panel for the analysis 
of a plurality of markers. The panel allows for the simultaneous analysis of multiple 
markers correlating with carcinogenesis and/or metastasis. Depending on the subject, 

25 panels may be analyzed alone or in combination in order to provide the best possible 
diagnosis and prognosis. Markers for inclusion on a panel are selected by screening 
for their predictive value using any suitable method, including but not limited to, 
those described in the illustrative examples below. 

30 1 . Detection of RNA 

In some preferred embodiments, detection of solid tumor stem cell cancer 
markers (e.g., including but not limited to, those disclosed in Tables 4-8) are detected 
by measuring the expression of corresponding mRNA in a tissue sample (e.g., breast 



52-- 



BNSDOC1D: <WO 200500560 1A2_I_> 



WO 2005/005601 



PCT/US2004/018266 



cancer tissue). mRNA expression may be measured by any suitable method, 
including but not limited to, those disclosed below. 

In some embodiments, RNA is detection by Northern blot analysis. Northern 
blot analysis involves the separation of RNA and hybridization of a complementary 

5 labeled probe. 

In still further embodiments, RNA (or corresponding cDNA) is detected by 
hybridization to a oligonucleotide probe). A variety of hybridization assays using a 
variety of technologies for hybridization and detection are available. For example, in 
some embodiments, TaqMan assay (PE Biosystems, Foster City, CA; See e.g., U.S. 

1 0 Patent Nos. 5,962,233 and 5,538,848, each of which is herein incorporated by 

reference) is utilized. The assay is performed during a PCR reaction. The TaqMan 
assay exploits the 5'-3' exonuclease activity of the AMPLITAQ GOLD DNA 
polymerase. A probe consisting of an oligonucleotide with a 5 f -reporter dye (e.g., a 
fluorescent dye) and a 3'-quencher dye is included in the PCR reaction. During PCR, 

1 5 if the probe is bound to its target, the 5'-3' nucleolytic activity of the AMPLITAQ 

GOLD polymerase cleaves the probe between the reporter and the quencher dye. The 
separation of the reporter dye from the quencher dye results in an increase of 
fluorescence. The signal accumulates with each cycle of PCR and can be monitored 
with a fluorimeter. 

20 In yet other embodiments, reverse-transcriptase PCR (RT-PCR) is used to 

detect the expression of RNA. In RT-PCR, RNA is enzymatically converted to 
complementary DNA or "cDNA" using a reverse transcriptase enzyme. The cDNA is 
then used as a template for a PCR reaction. PCR products can be detected by any 
suitable method, including but not limited to, gel electrophoresis and staining with a 

25 DNA specific stain or hybridization to a labeled probe. In some embodiments, the 
quantitative reverse transcriptase PCR with standardized mixtures of competitive 
templates method described in U.S. Patents 5,639,606, 5,643,765, and 5,876,978 
(each of which is herein incorporated by reference) is utilized. 

30 2. Detection of Protein 

In other embodiments, gene expression of stem cell cancer markers is detected 
by measuring the expression of the corresponding protein or polypeptide. Protein 
expression may be detected by any suitable method. In some embodiments, proteins 
are detected by immunohistochemistry. In other embodiments, proteins are detected 
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by their binding to an antibody raised against the protein. The generation of 
antibodies is described below. 

Antibody binding is detected by techniques known in the art (e.g. , 
radloimmunoaiiay, BLISA (enzyma-linked ittmtundiorbattt auay). "landwioh" 

5 immunoassays, immunoradiometric assays, gel diffusion precipitation reactions, 

immunodiffusion assays, in situ immunoassays (e.g., using colloidal gold, enzyme or 
radioisotope labels, for example), Western blots, precipitation reactions, agglutination 
assays (e.g., gel agglutination assays, hemagglutination assays, etc.), complement 
fixation assays, immunofluorescence assays, protein A assays, and 

10 Immunoelectrophoresis assays, etc. 

In one embodiment, antibody binding is detected by detecting a label on the 
primary antibody. In another embodiment, the primary antibody is detected by 
detecting binding of a secondary antibody or reagent to the primary antibody. In a 
further embodiment, the secondary antibody is labeled. Many methods are known in 

15 the art for detecting binding in an immunoassay and are within the scope of the 
present invention. 

In some embodiments, an automated detection assay is utilized. Methods for 
the automation of immunoassays include those described in U.S. Patents 5,885,530, 
4,98 1 ,785, 6, 1 59,750, and 5,358,691 , each of which is herein incorporated by 
20 reference. In some embodiments, the analysis and presentation of results is also 

automated. For example, in some embodiments, software that generates a prognosis 
based on the presence or absence of a series of proteins corresponding to cancer 
markers is utilized. 

In other embodiments, the immunoassay described in U.S. Patents 5,599,677 

25 and 

5,672,480; each of which is herein incorporated by reference. 
3. Data Analysis 

In some embodiments, a computer-based analysis program is used to translate 
30 the raw data generated by the detection assay (e.g., the presence, absence, or amount 
of a given marker or markers) into data of predictive value for a clinician. The 
clinician can access the predictive data using any suitable means. Thus, in some 
preferred embodiments, the present invention provides the further benefit that the 
clinician, who is not likely to be trained in genetics or molecular biology, need not 
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understand the raw data. The data is presented directly to the clinician in its most 
useful form. The clinician is then able to immediately utilize the information in order 
to optimize the care of the subject. 

The present invention contemplates any method capable of receiving, 
5 processing, and transmitting the information to and from laboratories conducting the 
assays, information provides, medical personal, and subjects. For example, in some 
embodiments of the present invention, a sample (e.g., a biopsy or a serum or urine 
sample) is obtained from a subject and submitted to a profiling service (e.g., clinical 
lab at a medical facility, genomic profiling business, etc.), located in any part of the 

10 world (e.g., in a country different than the country where the subject resides or where 
the information is ultimately used) to generate raw data. Where the sample comprises 
a tissue or other biological sample, the subject may visit a medical center to have the 
sample obtained and sent to the profiling center, or subjects may collect the sample 
themselves and directly send it to a profiling center. Where the sample comprises 

15 previously determined biological information, the information may be directly sent to 
the profiling service by the subject (e.g., an information card containing the 
information may be scanned by a computer and the data transmitted to a computer of 
the profiling center using an electronic communication systems). Once received by 
the profiling service, the sample is processed and a profile is produced (e.g., 

20 expression data), specific for the diagnostic or prognostic information desired for the 
subject. 

The profile data is then prepared in a format suitable for interpretation by a 
treating clinician. For example, rather than providing raw expression data (e.g. 
examining a number of the markers described in Tables 4-8), the prepared format may 

25 represent a diagnosis or risk assessment for the subject, along with recommendations 
for particular treatment options. The data may be displayed to the clinician by any 
suitable method. For example, in some embodiments, the profiling service generates 
a report that can be printed for the clinician (e.g., at the point of care) or displayed to 
the clinician on a computer monitor. 

30 In some embodiments, the information is first analyzed at the point of care or 

at a regional facility. The raw data is then sent to a central processing facility for 
further analysis and/or to convert the raw data to information useful for a clinician or 
patient. The central processing facility provides the advantage of privacy (all data is 
stored in a central facility with uniform security protocols), speed, and uniformity of 
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data analysis. The central processing facility can then control the fate of the data 
following treatment ef the §ubject. Far example, using an electronic communication 

system, the central facility can provide data to the clinician, the subject, or 
researchers. 

5 In some embodiments, the subject is able to directly access the data using the 

electronic communication system. The subject may chose further intervention or 
counseling based on the results. In some embodiments, the data is used for research 
use. For example, the data may be used to further optimize the inclusion or 
elimination of markers as useful indicators of a particular condition or stage of 
10 disease. 



4. Kits 

In yet other embodiments, the present invention provides kits for the detection 
and characterization of cancer (e.g. for detecting one or more of the markers shown in 

15 Tables 4-8, or for modulating the activity of a peptide expressed by one or more of 
markes shown in Tables 4-8). In some embodiments, the kits contain antibodies 
specific for a cancer marker, in addition to detection reagents and buffers. In other 
embodiments, the kits contain reagents specific for the detection of mRNA or cDNA 
(e.g., oligonucleotide probes or primers). In preferred embodiments, the kits contain 

20 all of the components necessary to perform a detection assay, including all controls, 
directions for performing assays, and any necessary software for analysis and 
presentation of results. 



5. In vivo Imaging 

25 In some embodiments, in vivo imaging techniques are used to visualize the 

expression of cancer markers in an animal (e.g., a human or non-human mammal). 
For example, in some embodiments, cancer marker mRNA or protein is labeled using 
an labeled antibody specific for the cancer marker. A specifically bound and labeled 
antibody can be detected in an individual using an in vivo imaging method, including, 

30 but not limited to, radionuclide imaging, positron emission tomography, computerized 
axial tomography, X-ray or magnetic resonance imaging method, fluorescence 
detection, and chemiluminescent detection. Methods for generating antibodies to the 
cancer markers of the present invention are described below. 
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The in vivo imaging methods of the present invention are useful in the 
diagnosis of cancers that express the solid tumor stem cell cancer markers of the 
present invention (e.g., in breast cancer). In vivo imaging is used to visualize the 
presence of a marker indicative of the cancer. Such techniques allow for diagnosis 

5 without the use of an unpleasant biopsy. The in vivo imaging methods of the present 
invention are also useful for providing prognoses to cancer patients. For example, the 
presence of a marker indicative of cancer stem cells can be detected. The in vivo 
imaging methods of the present invention can further be used to detect metastatic 
cancers in other parts of the body. 

0 In some embodiments, reagents (e.g., antibodies) specific for the cancer 

markers of the present invention are fluorescently labeled. The labeled antibodies are 
introduced into a subject (e.g., orally or parenterally). Fluorescently labeled 
antibodies are detected using any suitable method (e.g., using the apparatus described 
in U.S. Patent 6,198,107, herein incorporated by reference). 

5 In other embodiments, antibodies are radioactively labeled. The use of 

antibodies for in vivo diagnosis is well known in the art. Sumerdon et al., (Nucl. 
Med. Biol 17:247-254 [1990] have described an optimized antibody-chelator for the 
radioimmunoscintographic imaging of tumors using Indium- 1 1 1 as the label. Griffin 
et al, (J Clin One 9:631-640 [1991]) have described the use of this agent in detecting 

0 tumors in patients suspected of having recurrent colorectal cancer. The use of similar 
agents with paramagnetic ions as labels for magnetic resonance imaging is known in 
the art (Lauffer, Magnetic Resonance in Medicine 22:339-342 [1991]). The label 
used will depend on the imaging modality chosen. Radioactive labels such as Indium- 
111, Technetium-99m, or Iodine-131 can be used for planar scans or single photon 

5 emission computed tomography (SPECT). Positron emitting labels such as Fluorine- 
19 can also be used for positron emission tomography (PET). For MRI, paramagnetic 
ions such as Gadolinium (HI) or Manganese (II) can be used. 

Radioactive metals with half-lives ranging from 1 hour to 3.5 days are 
available for conjugation to antibodies, such as scandium-47 (3.5 days) gallium-67 

0 (2.8 days), gallium-68 (68 minutes), technetiium-99m (6 hours), and indium-1 1 1 (3.2 
days), of which gallium-67, technetium-99m, and indium-1 1 1 are preferable for 
gamma camera imaging, gallium-68 is preferable for positron emission tomography. 

A useful method of labeling antibodies with such radiometals is by means of a 
Afunctional chelating agent, such as diethylenetriaminepentaacetic acid (DTP A), as 
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described, for example, by Khaw et al. (Science 209:295 [1980]) for In- 11 1 and Tc- 
99m, and by Scheinberg et al (Science 215:1511 [1982]). Other chelating agents 
may also be used, but the 1 -(p-carboxymethoxybenzyl)EDTA and the 
carboxycarbonic anhydride of DTP A are advantageous because their use permits 
5 conjugation without affecting the antibody's immunoreactivity substantially. 

Another method for coupling DPTA to proteins is by use of the cyclic 
anhydride of DTP A, as described by Hnatowich et al. (Int. J. Appl. Radiat. Isot. 
33:327 [1982]) for labeling of albumin with In- 111, but which can be adapted for 
labeling of antibodies. A suitable method of labeling antibodies with Tc-99m which 

10 does not use chelation with DPTA is the pretinning method of Crockford et al., (U.S. 
Pat. No. 4,323,546, herein incorporated by reference). 

A preferred method of labeling immunoglobulins with Tc-99m is that 
described by Wong et al (Int. J. Appl. Radiat. Isot., 29:251 [1978]) for plasma 
protein, and recently applied successfully by Wong et al. (J. Nucl. Med., 23:229 

15 [ 1 98 1 ]) for labeling antibodies. 

In the case of the radiometals conjugated to the specific antibody, it is likewise 
desirable to introduce as high a proportion of the radiolabel as possible into the 
antibody molecule without destroying its immunospecificity. A further improvement 
may be achieved by effecting radiolabeling in the presence of the specific stem cell 

20 cancer marker of the present invention, to insure that the antigen binding site on the 
antibody will be protected. 

In still further embodiments, in vivo biophotonic imaging (Xenogen, Almeda, 
CA) is utilized for in vivo imaging. This real-time in vivo imaging utilizes luciferase. 
The luciferase gene is incorporated into cells, microorganisms, and animals (e.g., as a 
25 fusion protein with a cancer marker of the present invention). When active, it leads to 
a reaction that emits light. A CCD camera and software is used to capture the image 
and analyze it. 

V. Antibodies and Antibody Fragments 
30 The present invention provides isolated antibodies and antibody fragments 

(e.g, Fabs). In preferred embodiments, the present invention provides monoclonal 
antibodies or antibody fragments that specifically bind to an isolated polypeptide 
comprised of at least five, or at least 15 amino acid residues of the stem cell cancer 
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markers described herein (e.g., as shown in Tables 4-8). These antibodies or antibody 
fragments find use in the diagnostic, drug screening, and therapetuic methods 
described herein (e.g. to detect or modulate the activity of a stem cell cancer marker 
peptide). 

5 An antibody, or antibody fragment, against a protein of the present invention 

may be any monoclonal or polyclonal antibody, as long as it can recognize the 
protein. Antibodies can be produced by using a protein of the present invention as the 
antigen according to a conventional antibody or antiserum preparation process. 

The present invention contemplates the use of both monoclonal and polyclonal 

10 antibodies. Any suitable method may be used to generate the antibodies used in the 
methods and compositions of the present invention, including but not limited to, those 
disclosed herein. For example, for preparation of a monoclonal antibody, protein, as 
such, or together with a suitable carrier or diluent is administered to an animal (e.g., a 
mammal) under conditions that permit the production of antibodies. For enhancing 

1 5 the antibody production capability, complete or incomplete Freund's adjuvant may be 
administered. Normally, the protein is administered once every 2 weeks to 6 weeks, 
in total, about 2 times to about 10 times. Animals suitable for use in such methods 
include, but are not limited to, primates, rabbits, dogs, guinea pigs, mice, rats, sheep, 
goats, etc. 

20 For preparing monoclonal antibody-producing cells, an individual animal 

whose antibody titer has been confirmed (e.g., a mouse) is selected, and 2 days to 5 
days after the final immunization, its spleen or lymph node is harvested and 
antibody-producing cells contained therein are fused with myeloma cells to prepare 
the desired monoclonal antibody producer hybridoma. Measurement of the antibody 

25 titer in antiserum can be carried out, for example, by reacting the labeled protein, as 
described hereinafter and antiserum and then measuring the activity of the labeling 
agent bound to the antibody. The cell fusion can be carried out according to known 
methods, for example, the method described by Koehler and Milstein (Nature 256:495 
[1975]). As a fusion promoter, for example, polyethylene glycol (PEG) or Sendai 

30 virus (HVJ), preferably PEG is used. 

Examples of myeloma cells include NS-1, P3U1, SP2/0, AP-1 and the like. 
The proportion of the number of antibody producer cells (spleen cells) and the 
number of myeloma cells to be used is preferably about 1 : 1 to about 20; 1 . PEG 
(preferably PEG 1000-PEG 6000) is preferably added in concentration of about 10% 
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to about 80%. Cell ftision can be earned out efficiently by incubating a mixture of 

both cells at about 20°C to about 40°C, preferably about 30°C to about 37°C for about 
1 minute to 10 minutes. 

Various methods may be used for screening for a hybridoma producing the 

5 antibody (e.g., against a tumor antigen or autoantibody of the present invention). For 
example, where a supernatant of the hybridoma is added to a solid phase (e.g., 
microplate) to which antibody is adsorbed directly or together with a carrier and then 
an anti-immunoglobulin antibody (if mouse cells are used in cell ftision, anti -mouse 
immunoglobulin antibody is used) or Protein A labeled with a radioactive substance 

10 or an enzyme is added to detect the monoclonal antibody against the protein bound to 
the solid phase. Alternately, a supernatant of the hybridoma is added to a solid phase 
to which an anti-immunoglobulin antibody or Protein A is adsorbed and then the 
protein labeled with a radioactive substance or an enzyme is added to detect the 
monoclonal antibody against the protein bound to the solid phase. 

15 Selection of the monoclonal antibody can be carried out according to any 

known method or its modification. Normally, a medium for animal cells to which 
HAT (hypoxanthine, aminopterin, thymidine) are added is employed. Any selection 
and growth medium can be employed as long as the hybridoma can grow. For 
example, RPMI 1640 medium containing 1% to 20%, preferably 10% to 20% fetal 

20 bovine serum, GIT medium containing 1% to 10% fetal bovine serum, a serum free 

medium for cultivation of a hybridoma (SFM-101, Nissui Seiyaku) and the like can be 
used. Normally, the cultivation is carried out at 20°C to 40°C, preferably 37°C for 
about 5 days to 3 weeks, preferably 1 week to 2 weeks under about 5% CO2 gas. The 
antibody titer of the supernatant of a hybridoma culture can be measured according to 

25 the same manner as described above with respect to the antibody titer of the 
anti-protein in the antiserum. 

Separation and purification of a monoclonal antibody (e.g., against a cancer 
marker of the present invention) can be carried out according to the same manner as 
those of conventional polyclonal antibodies such as separation and purification of 

30 immunoglobulins, for example, salting-out, alcoholic precipitation, isoelectric point 
precipitation, electrophoresis, adsorption and desorption with ion exchangers (eg., 
DEAE), ultracentrifiigation, gel filtration, or a specific purification method wherein 
only an antibody is collected with an active adsorbent such as an antigen-binding 
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solid phase, Protein A or Protein G and dissociating the binding to obtain the 
antibody. 

Polyclonal antibodies may be prepared by any known method or modifications 
of these methods including obtaining antibodies from patients. For example, a 
5 complex of an immunogen (an antigen against the protein) and a carrier protein is 
prepared and an animal is immunized by the complex according to the same manner 
as that described with respect to the above monoclonal antibody preparation. A 
material containing the antibody against is recovered from the immunized animal and 
the antibody is separated and purified. 

10 As to the complex of the immunogen and the carrier protein to be used for 

immunization of an animal, any carrier protein and any mixing proportion of the 
carrier and a hapten can be employed as long as an antibody against the hapten, which 
is crosslinked on the carrier and used for immunization, is produced efficiently. For 
example, bovine serum albumin, bovine cycloglobulin, keyhole limpet hemocyanin, 

15 etc. may be coupled to an hapten in a weight ratio of about 0.1 part to about 20 parts, 
preferably, about 1 part to about 5 parts per 1 part of the hapten. 

In addition, various condensing agents can be used for coupling of a hapten 
and a carrier. For example, glutaraldehyde, carbodiimide, maleimide activated ester, 
activated ester reagents containing thiol group or dithiopyridyl group, and the like 

20 find use with the present invention. The condensation product as such or together 

with a suitable carrier or diluent is administered to a site of an animal that permits the 
antibody production. For enhancing the antibody production capability, complete or 
incomplete Freund's adjuvant may be administered. Normally, the protein is 
administered once every 2 weeks to 6 weeks, in total, about 3 times to about 10 times. 

25 The polyclonal antibody is recovered from blood, ascites and the like, of an 

animal immunized by the above method. The antibody titer in the antiserum can be 
measured according to the same manner as that described above with respect to the 
supernatant of the hybridoma culture. Separation and purification of the antibody can 
be carried out according to the same separation and purification method of 

30 immunoglobulin as that described with respect to the above monoclonal antibody. 

The protein used herein as the immunogen is not limited to any particular type 
of immunogen. For example, a stem cell cancer marker of the present invention 
(further including a gene having a nucleotide sequence partly altered) can be used as 
the immunogen. Further, fragments of the protein may be used. Fragments may be 
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obtained by any methods including, but not limited to expressing a fragment of the 
gene, enzymatic processing of the protein, chemical synthesis, and the like. The 
antibodies and antibody fragments may also be conjugated to therapeutic (e.g. cancer 
cell killing compounds). In this regard, the antibody directed toward one of the stem 
5 cell cancer markers is used to specifically deliver a therapeutic agent to a solid tumor 
cancer cell (e.g. to inhibit the proliferation of such sell or kill such a cell). 

VI. Drug Screening 

In some embodiments, the present invention provides drug screening assays 
10 (e.g., to screen for anticancer drugs). The screening methods of the present invention 
utilize stem cell cancer markers identified using the methods of the present invention 
(e.g., including but not limited to, the stem cell cancer markers shown in Tables 4-8). 
For example, in some embodiments, the present invention provides methods of 
screening for compound that alter (e.g., increase or decrease) the expression of stem 

1 5 cell cancer marker genes. In some embodiments, candidate compounds are antisense 

i, 

agents or siRNA agents (eg., oligonucleotides) directed against cancer markers. In 
other embodiments, candidate compounds are antibodies that specifically bind to a 
stem cell cancer marker of the present invention. In certain embodiments, libraries of 
compounds of small molecules are screened using the methods described herein. 

20 In one screening method, candidate compounds are evaluated for their ability 

to alter stem cell cancer marker expression by contacting a compound with a cell 
expressing a stem cell cancer marker and then assaying for the effect of the candidate 
compounds on expression. In some embodiments, the effect of candidate compounds 
on expression of a cancer marker gene is assayed by detecting the level of cancer 

25 marker mRNA expressed by the cell. mRNA expression can be detected by any 
suitable method. In other embodiments, the effect of candidate compounds on 
expression of cancer marker genes is assayed by measuring the level of polypeptide 
encoded by the cancer markers. The level of polypeptide expressed can be measured 
using any suitable method, including but not limited to, those disclosed herein. In 

30 some embodiments, other changes in eel biology (e.g., apoptosis) are detected. 

Specifically, the present invention provides screening methods for identifying 
modulators, i.e., candidate or test compounds or agents (e.g., proteins, peptides, 
peptidomimetics, peptoids, small molecules or other drugs) which bind to, or alter the 
signalling or function associated with the cancer markers of the present invention, 

62 

BNSDOCID: <WO 2005005601 A2_l_> 



WO 2005/005601 



PCTYUS2004/018266 



have an inhibitory (or stimulatory) effect on, for example, stem cell cancer marker 
expression or cancer markers activity, or have a stimulatory or inhibitory effect on, 
for example, the expression or activity of a cancer marker substrate. Compounds thus 
identified can be used to modulate the activity of target gene products (e.g., stem cell 
5 cancer marker genes) either directly or indirectly in a therapeutic protocol, to 

elaborate the biological function of the target gene product, or to identify compounds 
that disrupt normal target gene interactions. Compounds which inhibit the activity or 
expression of cancer markers are useful in the treatment of proliferative disorders, 
e.g., cancer, particularly metastatic cancer or eliminating or controlling tumor stem 

1 0 cells to prevent or reduce the risk of cancer. 

In one embodiment, the invention provides assays for screening candidate or 
test compounds that are substrates of a cancer markers protein or polypeptide or a 
biologically active portion thereof. In another embodiment, the invention provides 
assays for screening candidate or test compounds that bind to or modulate the activity 

15 of a cancer marker protein or polypeptide or a biologically active portion thereof. 

The test compounds of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods known in the art, including 
biological libraries; peptoid libraries (libraries of molecules having the functionalities 
of peptides, but with a novel, non-peptide backbone, which are resistant to enzymatic 

20 degradation but which nevertheless remain bioactive; see, e.g., Zuckennann et ah, 
J. Med. Chem. 37: 2678-85 [1994]); spatially addressable parallel solid phase or 
solution phase libraries; synthetic library methods requiring deconvolution; the 'one- 
bead one-compound 1 library method; and synthetic library methods using affinity 
chromatography selection. The biological library and peptoid library approaches are 

25 preferred for use with peptide libraries, while the other four approaches are applicable 
to peptide, non-peptide oligomer or small molecule libraries of compounds (Lam 
(1997) Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in 
the art, for example in: DeWitt et ah, Proc. Natl Acad. Sci. U.S.A. 90:6909 [1993]; 

30 Erb et ah, Proc. Nad. Acad. Sci. USA 91 : 1 1422 [1994]; Zuckermann et ah, J. Med. 
Chem. 37:2678 [1994]; Cho et ah, Science 261:1303 [1993]; Carrell et ah, Angew. 
Chem. Int. Ed. Engl. 33.2059 [1994]; Carell et ah, Angew. Chem. Int. Ed. Engl. 
33:2061 [1994]; and Gallop et ah, J. Med. Chem. 37:1233 [1994]. 
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Libraries of compounds may be presented in solution (e.g., Hdtighten, 

Biateehnlquea 13:412.421 [1M2]), or on baadi (Lam, Katiira 3 54s 12-84 [1001 J), 

chips (Fodor, Nature 364:555-556 [1993]), bacteria or spores (U.S. Patent No. 
5,223,409; herein incorporated by reference), plasmids (Cull et al, Proc. Nad. Acad. 
5 Sci. USA 89:18651869 [1992]) or on phage (Scott and Smith, Science 249:386-390 
[1990]; Devlin Science 249:404-406 [1990]; Cwirla et aL, Proc. Natl. Acad. Sci. 
87:6378-6382 [1990]; Felici, J. Mol. Biol. 222:301 [1991]). 

In one embodiment, an assay is a cell-based assay in which a cell that 
expresses a stem cell cancer marker protein or biologically active portion thereof is 
10 contacted with a test compound, and the ability of the test compound to the modulate 
cancer marker's activity is determined. Determining the ability of the test compound 
to modulate stem cell cancer marker activity can be accomplished by monitoring, for 
example, changes in enzymatic activity. The cell, for example, can be of mammalian 
origin. 

1 5 The ability of the test compound to modulate cancer marker binding to a 

compound, e.g., a stem cell cancer marker substrate, can also be evaluated. This can 
be accomplished, for example, by coupling the compound, e.g., the substrate, with a 
radioisotope or enzymatic label such that binding of the compound, e.g., the substrate, 
to a cancer marker can be determined by detecting the labeled compound, e.g.,^ 

20 substrate, in a complex. 

Alternatively, the stem cell cancer marker is coupled with a radioisotope or 
enzymatic label to monitor the ability of a test compound to modulate cancer marker 
binding to a cancer markers substrate in a complex. For example, compounds (e.g., 
substrates) can be labeled with 125 1, 35 S 14 C or 3 H, either directly or indirectly, and the 

25 radioisotope detected by direct counting of radioemmission or by scintillation 

counting. Alternatively, compounds can be enzymatically labeled with, for example, 
horseradish peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label 
detected by determination of conversion of an appropriate substrate to product. 
The ability of a compound (e.g., a stem cell cancer marker substrate) to 

30 interact with a stem cell cancer marker with or without the labeling of any of the 

interactants can be evaluated. For example, a microphysiorneter can be used to detect 
the interaction of a compound with a cancer marker without the labeling of either the 
compound or the cancer marker (McConnell et al. Science 257:1906-1912 [1992]). 
As used herein, a "microphysiorneter" (e.g., Cytosensor) is an analytical instrument 
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that measures the rate at which a cell acidifies its environment using a light- 
addressable potentiometric sensor (LAPS). Changes in this acidification rate can be 
used as an indicator of the interaction between a compound and cancer markers. 

In yet another embodiment, a cell-free assay is provided in which a cancer 
5 marker protein or biologically active portion thereof is contacted with a test 

compound and the ability of the test compound to bind to the stem cell cancer marker 
protein or biologically active portion thereof is evaluated. Preferred biologically 
active portions of the cancer markers proteins to be used in assays of the present 
invention include fragments that participate in interactions with substrates or other 
1 0 proteins, e.g., fragments with high surface probability scores. 

Cell-free assays involve preparing a reaction mixture of the target gene protein 
and the test compound under conditions and for a time sufficient to allow the two 
components to interact and bind, thus forming a complex that can be removed and/or 
detected. 

1 5 The interaction between two molecules can also be detected, e.g., using 

fluorescence energy transfer (FRET) (see, for example, Lakowicz et al. 9 U.S. Patent 
No. 5,631,169; Stavrianopoulos et al, U.S. Patent No. 4,968,103; each of which is 
herein incorporated by reference). A fluorophore label is selected such that a first 
donor molecule's emitted fluorescent energy will be absorbed by a fluorescent label 

20 on a second, 'acceptor* molecule, which in turn is able to fluoresce due to the absorbed 
energy. 

Alternately, the 'donor 1 protein molecule may simply utilize the natural 
fluorescent energy of tryptophan residues. Labels are chosen that emit different 
wavelengths of light, such that the 'acceptor 1 molecule label may be differentiated 

25 from that of the 'donor'. Since the efficiency of energy transfer between the labels is 
related to the distance separating the molecules, the spatial relationship between the 
molecules can be assessed. In a situation in which binding occurs between the 
molecules, the fluorescent emission of the 'acceptor' molecule label in 1 5 the assay 
should be maximal. An FRET binding event can be conveniently measured through 

30 standard fluorometric detection means well known in the art (e.g., using a 
fluorimeter). 

In another embodiment, determining the ability of the stem cell cancer 
markers protein to bind to a target molecule can be accomplished using real-time 
Biomolecular Interaction Analysis (BIA) (see, e.g., Sjolander and Urbaniczky, Anal. 
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Chem. 63:2338-2345 [1991] and Szabo etal Curr. Opin. Struct. Biol. 5:699-705 
[1993]). "Surfae plftimon re«onnnce M or "BIA" detect! bloipeoifle interactions in 

real time, without labeling any of the interactants (e.g., BlAcore). Changes in the 
mass at the binding surfeee (indicative of a binding event) result in alterations of the 

5 refractive index of light near the surface (the optical phenomenon of surface plasmon 
resonance (SPR)), resulting in a detectable signal that can be used as an indication of 
real-time reactions between biological molecules. 

In one embodiment, the target gene product or the test substance is anchored 
onto a solid phase. The target gene product/test compound complexes anchored on 
10 the solid phase can be detected at the end of the reaction. Preferably, the target gene 
product can be anchored onto a solid surface, and the test compound, (which is not 
anchored), can be labeled, either directly or indirectly, with detectable labels 
discussed herein. 

It may be desirable to immobilize stem cell cancer markers, an anti-cancer 

15 marker antibody or its target molecule to facilitate separation of complexed from non- 
complexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to a stem cell cancer marker 
protein, or interaction of a cancer marker protein with a target molecule in the 
presence and absence of a candidate compound, can be accomplished in any vessel 

20 suitable for containing the reactants. Examples of such vessels include microtiter 
plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein 
can be provided which adds a domain that allows one or both of the proteins to be 
bound to a matrix. For example, glutathione-S-transferase-cancer marker fusion 
proteins or glutathione-S-transferase/target fusion proteins can be adsorbed onto 

25 glutathione Sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione- 

derivatized microtiter plates, which are then combined with the test compound or the 
test compound and either the non-adsorbed target protein or cancer marker protein, 
and the mixture incubated under conditions conducive for complex formation (e.g., at 
physiological conditions for salt and pH). Following incubation, the beads or 

30 microtiter plate wells are washed to remove any unbound components, the matrix 

immobilized in the case of beads, complex determined either directly or indirectly, for 
example, as described above. 

Alternatively, the complexes can be dissociated from the matrix, and the level 
of cancer markers binding or activity determined using standard techniques. Other 
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techniques for immobilizing either cancer markers protein or a target molecule on 
matrices include using conjugation of biotin and streptavidin. Biotinylated cancer 
marker protein or target molecules can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce 

5 Chemicals, Rockford, EL), and immobilized in the wells of streptavidin-coated 96 
well plates (Pierce Chemical). 

In order to conduct the assay, the non-immobilized component is added to the 
coated surface containing the anchored component. After the reaction is complete, 
unreacted components are removed (e.g., by washing) under conditions such that any 

10 complexes formed will remain immobilized on the solid surface. The detection of 
complexes anchored on the solid surface can be accomplished in a number of ways. 
Where the previously non-immobilized component is pre-labeled, the detection of 
label immobilized on the surface indicates that complexes were formed. Where the 
previously non-immobilized component is not pre-labeled, an indirect label can be 

15 used to detect complexes anchored on the surface; e.g., using a labeled antibody 

specific for the immobilized component (the antibody, in turn, can be directly labeled 
or indirectly labeled with, e.g., a labeled anti-IgG antibody). 

This assay is performed utilizing antibodies reactive with stem cell cancer 
marker protein or target molecules but which do not interfere with binding of the stem 

20 cell cancer markers protein to its target molecule. Such antibodies can be derivatized 
to the wells of the plate, and unbound target or cancer markers protein trapped in the 
wells by antibody conjugation. Methods for detecting such complexes, in addition to 
those described above for the GST-immobilized complexes, include immunodetection 
of complexes using antibodies reactive with the cancer marker protein or target 

25 molecule, as well as enzyme-linked assays which rely on detecting an enzymatic 
activity associated with the cancer marker protein or target molecule. 

Alternatively, cell free assays can be conducted in a liquid phase. In such an 
assay, the reaction products are separated from unreacted components, by any of a 
number of standard techniques, including, but not limited to: differential 

30 centrifugation (see, for example, Rivas and Minton, Trends Biochem Sci 18:284-7 
[1993]); chromatography (gel filtration chromatography, ion-exchange 
chromatography); electrophoresis (see, e.g., Ausubel etal, eds. Current Protocols in 
Molecular Biology 1999, J. Wiley: New York.); and immunoprecipitation (see, for 
example, Ausubel et al, eds. Current Protocols in Molecular Biology 1999, J. Wiley: 
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New York). Such resins and chromatographic techniques are known to one skilled in 
the art (See e.g., Heegaard J. Mol. Recognit 11:141-8 [1998]; Hageand Tweed J. 
Chromatogr. Biomed. Sci. Appl 699:499-525 [1997]). Further, fluorescence energy 
transfer may also be conveniently utilized, as described herein, to detect binding 
5 without further purification of the complex from solution. 

The assay can include contacting the stem cell cancer markers protein or 
biologically active portion thereof with a known compound that binds the cancer 
marker to form an assay mixture, contacting the assay mixture with a test compound, 
and determining the ability of the test compound to interact with a cancer marker 

10 protein, wherein determining the ability of the test compound to interact with a cancer 
marker protein includes determining the ability of the test compound to preferentially 
bind to cancer markers or biologically active portion thereof, or to modulate the 
activity of a target molecule, as compared to the known compound. 

To the extent that stem cell cancer markers can, in vivo, interact with one or 

15 more cellular or extracellular macromolecules, such as proteins, inhibitors of such an 
interaction are useful. A homogeneous assay can be used can be used to identify 
inhibitors. 

For example, a preformed complex of the target gene product and the 
interactive cellular or extracellular binding partner product is prepared such that either 

20 the target gene products or their binding partners are labeled, but the signal generated 
by the label is quenched due to complex formation (see, e.g., U.S. Patent No. 
4,109,496, herein incorporated by reference, that utilizes this approach for 
immunoassays). The addition of a test substance that competes with and displaces 
one of the species from the preformed complex will result in the generation of a signal 

25 above background. In this way, test substances that disrupt target gene product- 
binding partner interaction can be identified. Alternatively, cancer markers protein 
can be used as a "bait protein" in a two-hybrid assay or three-hybrid assay (see, e.g., 
U.S. Patent No. 5,283,317; Zervos et al, Cell 72:223-232 [1993]; Madura et al, J. 
Biol. Chem. 268.12046-12054 [1993]; Bartel et al, Biotechniques 14:920-924 [1993]; 

30 Iwabuchi et al, Oncogene 8: 1693-1696 [1993]; and Brent W0 94/10300; each of 
which is herein incorporated by reference), to identify other proteins, that bind to or 
interact with cancer markers ("cancer marker-binding proteins" or "cancer marker- 
bp") and are involved in cancer marker activity. Such cancer marker-bps can be 
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activators or inhibitors of signals by the cancer marker proteins or targets as, for 
example, downstream elements of a cancer markers-mediated signaling pathway. 

Modulators of cancer markers expression can also be identified. For example, 
a cell or cell free mixture is contacted with a candidate compound and the expression 
5 of cancer marker mRNA or protein evaluated relative to the level of expression of 
stem cell cancer marker mRNA or protein in the absence of the candidate compound. 
When expression of cancer marker mRNA or protein is greater in the presence of the 
candidate compound than in its absence, the candidate compound is identified as a 
stimulator of cancer marker mRNA or protein expression. Alternatively, when 

1 0 expression of cancer marker mRNA or protein is less (i.e., statistically significantly 
less) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of cancer marker mRNA or protein expression. 
The level of cancer markers mRNA or protein expression can be determined by 
methods described herein for detecting cancer markers mRNA or protein. 

1 5 A modulating agent can be identified using a cell-based or a cell free assay, 

and the ability of the agent to modulate the activity of a cancer markers protein can be 
confirmed in vivo, e.g., in an animal such as an animal model for a disease (e.g., an 
animal with prostate cancer or metastatic prostate cancer; or an animal harboring a 
xenograft of a prostate cancer from an animal (e.g., human) or cells from a cancer 

20 resulting from metastasis of a prostate cancer (e.g., to a lymph node, bone, or liver), 
or cells from a prostate cancer cell line. 

This invention further pertains to novel agents identified by the above- 
described screening assays (See e.g., below description of cancer therapies). 
Accordingly, it is within the scope of this invention to further use an agent identified 

25 as described herein (e.g., a cancer marker modulating agent, an antisense cancer 

marker nucleic acid molecule, a siRNA molecule, a cancer marker specific antibody, 
or a cancer marker-binding partner) in an appropriate animal model (such as those 
described herein) to determine the efficacy, toxicity, side effects, or mechanism of 
action, of treatment with such an agent. Furthermore, novel agents identified by the 

30 above-described screening assays can be, e.g., used for treatments as described herein 
(e.g. to treat a human patient who has cancer). 
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VII. Cancer Therapies 

In some embodiments, the present invention provides therapies for cancer 
(e.g., breast cancer). In some embodiments, therapies target cancer markers (e.g., 
including but not limited to, those shown in Tables 4-8). 

5 

A. Antisense Therapies 

Candidate therapeutic agents also find use in drug screening and research 
applications. In some embodiments, the present invention targets the expression of 
stem cell cancer markers. For example, in some embodiments, the present invention 

10 employs compositions comprising oligomeric antisense compounds, particularly 
oligonucleotides (e.g., those identified in the drug screening methods described 
above), for use in modulating the function of nucleic acid molecules encoding stem 
cell cancer markers of the present invention, ultimately modulating the amount of 
cancer marker expressed. This is accomplished by providing antisense compounds 

15 that specifically hybridize with one or more nucleic acids encoding cancer markers of 
the present invention. The specific hybridization of an oligomeric compound with its 
target nucleic acid interferes with the normal function of the nucleic acid. This 
modulation of function of a target nucleic acid by compounds that specifically, 
hybridize to it is generally referred to as "antisense." The functions of DNA tq;be 

20 interfered with include replication and transcription. The functions of RNA to be 
interfered with include all vital functions such as, for example, translocation of the 
RNA to the site of protein translation, translation of protein from the RNA, splicing of 
the RNA to yield one or more mRNA species, and catalytic activity that may be 
engaged in or facilitated by the RNA. The overall effect of such interference with 

25 target nucleic acid function is modulation of the expression of cancer markers of the 
present invention. In the context of the present invention, "modulation" means either 
an increase (stimulation) or a decrease (inhibition) in the expression of a gene. For 
example, expression may be inhibited to potentially prevent tumor proliferation. 

It is preferred to target specific nucleic acids for antisense. "Targeting" an 

30 antisense compound to a particular nucleic acid, in the context of the present 

invention, is a multistep process. The process usually begins with the identification of 
a nucleic acid sequence whose function is to be modulated. This may be, for 
example, a cellular gene (or mRNA transcribed from the gene) whose expression is 
associated with a particular disorder or disease state, or a nucleic acid molecule from 
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an infectious agent. In the present invention, the target is a nucleic acid molecule 
encoding a stem cell cancer marker of the present invention. The targeting process 
also includes determination of a site or sites within this gene for the antisense 
interaction to occur such that the desired effect, e.g., detection or modulation of 
5 expression of the protein, will result. Within the context of the present invention, a 
preferred intragenic site is the region encompassing the translation initiation or 
termination codon of the open reading frame (ORF) of the gene. Since the translation 
initiation codon is typically 5 f -AUG (in transcribed mRNA molecules; 5'-ATG in the 
corresponding DNA molecule), the translation initiation codon is also referred to as 

1 0 the "AUG codon," the "start codon" or the "AUG start codon". A minority of genes 
have a translation initiation codon having the RNA sequence 5'-GUG, 5-UUG or 
5'-CUG, and 5*-AUA, 5-ACG and 5-CUG have been shown to function in vivo. 
Thus, the terms "translation initiation codon" and "start codon" can encompass many 
codon sequences, even though the initiator amino acid in each instance is typically 

1 5 methionine (in eukaryotes) or formylmethionine (in prokaryotes). Eukaryotic and 
prokaryotic genes may have two or more alternative start codons, any one of which 
may be preferentially utilized for translation initiation in a particular cell type or 
tissue, or under a particular set of conditions. In the context of the present invention, 
"start codon" and "translation initiation codon" refer to the codon or codons that are 

20 used in vivo to initiate translation of an mRNA molecule transcribed from a gene 
encoding a tumor antigen of the present invention, regardless of the sequence(s) of 
such codons. 

Translation termination codon (or "stop codon") of a gene may have one of 
three sequences (i.e., 5 f -UAA, 5-UAG and 5'-UGA; the corresponding DNA 

25 sequences are 5'-TAA, 5'-TAG and 5'-TGA, respectively). The terms "start codon 
region" and "translation initiation codon region" refer to a portion of such an mRNA 
or gene that encompasses from about 25 to about 50 contiguous nucleotides in either 
direction (i.e., 5' or 3') from a translation initiation codon. Similarly, the terms "stop 
codon region" and "translation termination codon region" refer to a portion of such an 

30 mRNA or gene that encompasses from about 25 to about 50 contiguous nucleotides in 
either direction (i.e., 5' or 3') from a translation termination codon. 

The open reading frame (ORF) or "coding region," which refers to the region 
between the translation initiation codon and the translation termination codon, is also 
a region that may be targeted effectively. Other target regions include the 5 1 
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untranslated region (5 f UTR), referring to the portion of an mRNA in the 5* direction 
from the translation initiation codon, and thus including nucleotides between the 5' 
cap site and the translation initiation codon of an mRNA or corresponding nucleotides 
on the gene, and the 3' untranslated region (3 f UTR), referring to the portion of an 
5 mRNA in the 3* direction from the translation termination codon, and thus including 
nucleotides between the translation termination codon and 3' end of an mRNA or 
corresponding nucleotides on the gene. The 5' cap of an mRNA comprises an 
N7-methylated guanosine residue joined to the 5'-most residue of the mRNA via a 
5-5' triphosphate linkage. The 5* cap region of an mRNA is considered to include the 

10 5* cap structure itself as well as the first 50 nucleotides adjacent to the cap. The cap 
region may also be a preferred target region. 

Although some eukaryotic mRNA transcripts are directly translated, many 
contain one or more regions, known as "introns," that are excised from a transcript 
before it is translated. The remaining (and therefore translated) regions are known as 

15 "exons" and are spliced together to form a continuous mRNA sequence. mRNA 

splice sites (i.e., intron-exon junctions) may also be preferred target regions, and are 
particularly useful in situations where aberrant splicing is implicated in disease, or 
where an overproduction of a particular mRNA splice product is implicated in 
disease. Aberrant fusion junctions due to rearrangements or deletions are also 

20 preferred targets. It has also been found that introns can also be effective, and 

therefore preferred, target regions for antisense compounds targeted, for example, to 
DNA or pre-mRNA. 

In some embodiments, target sites for antisense inhibition are identified using 
commercially available software programs (e.g., Biognostik, Gottingen, Germany; 

25 SysArris Software, Bangalore, India; Antisense Research Group, University of 

Liverpool, Liverpool, England; GeneTrove, Carlsbad, CA). In other embodiments, 
target sites for antisense inhibition are identified using the accessible site method 
described in U.S. Patent WO0198537A2, herein incorporated by reference. 

Once one or more target sites have been identified, oligonucleotides are 

30 chosen that are sufficiently complementary to the target (i.e., hybridize sufficiently 
well and with sufficient specificity) to give the desired effect. For example, in 
preferred embodiments of the present invention, antisense oligonucleotides are 
targeted to or near the start codon. 
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In the context of this invention, "hybridization," with respect to antisense 
compositions and methods, means hydrogen bonding, which may be Watson-Crick, 
Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary 
nucleoside or nucleotide bases. For example, adenine and thymine are 
5 complementary nucleobases that pair through the formation of hydrogen bonds. It is 
understood that the sequence of an antisense compound need not be 100% 
complementary to that of its target nucleic acid to be specifically hybridizable. An 
antisense compound is specifically hybridizable when binding of the compound to the 
target DN A or RNA molecule interferes with the normal function of the target DNA 

1 0 or RNA to cause a loss of utility, and there is a sufficient degree of complementarity 
to avoid non-specific binding of the antisense compound to non-target sequences 
under conditions in which specific binding is desired under physiological 
conditions in the case of in vivo assays or therapeutic treatment, and in the case of in 
vitro assays, under conditions in which the assays are performed). 

15 Antisense compounds are commonly used as research reagents and 

diagnostics. For example, antisense oligonucleotides, which are able to inhibit gene 
expression with specificity, can be used to elucidate the function of particular genes. 
Antisense compounds are also used, for example, to distinguish between functions of 
various members of a biological pathway. 

20 The specificity and sensitivity of antisense is also applied for therapeutic uses. 

For example, antisense oligonucleotides have been employed as therapeutic moieties 
in the treatment of disease states in animals and man. Antisense oligonucleotides 
have been safely and effectively administered to humans and numerous clinical trials 
are presently underway. It is thus established that oligonucleotides are useful 

25 therapeutic modalities that can be configured to be useful in treatment regimes for 
treatment of cells, tissues, and animals, especially humans. 

While antisense oligonucleotides are a preferred form of antisense compound, 
the present invention comprehends other oligomeric antisense compounds, including 
but not limited to oligonucleotide mimetics such as are described below. The 

30 antisense compounds in accordance with this invention preferably comprise from 

about 8 to about 30 nucleobases (i.e., from about 8 to about 30 linked bases), although 
both longer and shorter sequences may find use with the present invention. 
Particularly preferred antisense compounds are antisense oligonucleotides, even more 
preferably those comprising from about 12 to about 25 nucleobases. 
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Specific examples of preferred antisense compounds useful with the present 
invention include oligonucleotides containing modified backbones or non-natural 

internucleoside linkages. As defined in this specification, oligonucleotides having 
modified backbones include those that retain a phosphorus atom in the backbone and 
5 those that do not have a phosphorus atom in the backbone. For the purposes of this 
specification, modified oligonucleotides that do not have a phosphorus atom in their 
internucleoside backbone can also be considered to be oligonucleosides. 

Preferred modified oligonucleotide backbones include, for example, 
phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, 

10 aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 

3'-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates 
including 3*-amino phosphoramidate and aminoalkylphosphoramidates, 
thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and 
boranophosphates having normal 3'-5' linkages, 2-5* linked analogs of these, and 

1 5 those having inverted polarity wherein the adjacent pairs of nucleoside units are 

linked 3'-5' to 5'-3' or 2-5' to 5'-2\ Various salts, mixed salts and free acid forms are 
also included. 

Preferred modified oligonucleotide backbones that do not include a 
phosphorus atom therein have backbones that are formed by short chain alkyl or 

20 cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl 

internucleoside linkages, or one or more short chain heteroatomic or heterocyclic 
internucleoside linkages. These include those having morpholino linkages (formed in 
part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide 
and sulfone backbones; formacetyl and thioformacetyl backbones; methylene 

25 formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate 
backbones; methyleneimino and methylenehydrazino backbones; sulfonate and 
sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 
component parts. 

In other preferred oligonucleotide mimetics, both the sugar and the 

30 internucleoside linkage (i.e., the backbone) of the nucleotide units are replaced with 
novel groups. The base units are maintained for hybridization with an appropriate 
nucleic acid target compound. One such oligomeric compound, an oligonucleotide 
mimetic that has been shown to have excellent hybridization properties, is referred to 
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as a peptide nucleic acid (PNA). In PNA compounds, the sugar-backbone of an 
oligonucleotide is replaced with an amide containing backbone, in particular an 
aminoethylglycine backbone. The nucleobases are retained and are bound directly or 
indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative 
5 United States patents that teach the preparation of PNA compounds include, but are 
not limited to, U.S. Pat. Nos.: 5,539,082; 5,714,331; and 5,719,262, each of which is 
herein incorporated by reference. Further teaching of PNA compounds can be found 
in Nielsen et aL, Science 254:1497 (1991). 

Most preferred embodiments of the invention are oligonucleotides with 

1 0 phosphorothioate backbones and oligonucleosides with heteroatom backbones, and in 
particular --CH2, --NH--O--CH2--, --CH 2 --N(CH 3 )--0--CH2~ [known as a 
methylene (methylimino) or MMI backbone], --CH2--0--N(CH 3 )--CH2~, 
--CH 2 --N(CH 3 )--N(CH 3 )--CH 2 --, and ~0--N(CH 3 )--CH 2 --CH 2 -- [wherein the 
native phosphodiester backbone is represented as — 0--P-- O— CH2— ] of the above 

1 5 referenced U.S. Pat. No. 5,489,677, and the amide backbones of the above referenced 
U.S. Pat. No. 5,602,240. Also preferred are oligonucleotides having morpholino 
backbone structures of the above-referenced U.S. Pat. No. 5,034,506. 

Modified oligonucleotides may also contain one or more substituted sugar 
moieties. Preferred oligonucleotides comprise one of the following at the T position: 

20 OH; F; O, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-alkynyl; or 

O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or 
unsubstituted C] to C10 alkyl or C2 to C\q alkenyl and alkynyl. Particularly 

preferred are 0[(CH 2 )nO]mCH 3 , 0(CH 2 ) n OCH 3 , 0(CH 2 )nNH2, 0(CH 2 )nCH 3 , 
0(CH 2 ) n ONH 2 , and 0(CH 2 )nON[(CH2) n CH 3 )]2, where n and m are from 1 to 
25 about 10. Other preferred oligonucleotides comprise one of the following at the T 

position: C] to C\q lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or 

O-aralkyl, SH, SCH 3 , OCN, CI, Br, CN, CF 3 , OCF 3 , SOCH 3 , S0 2 CH 3 , ON0 2 , 
NO2, N 3 , NH2, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, 
polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an 
30 intercalator, a group for improving the pharmacokinetic properties of an 

oligonucleotide, or a group for improving the pharmacodynamic properties of an 
oligonucleotide, and other substituents having similar properties. A preferred 
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modification includes 2'-methoxyethoxy (2'-0--CH2CH20CH3, also known as 

2'-0-(2-methoxyethyl) or 2'-MOE) (Martin et al % Helv. Chim. Acta 78:486 [1995]) 

1. e., an alkoxyalkoxy group. A further preferred modification includes 

2 , -dimethylaminooxyethoxy (i.e f a 0(CH2>20N(CH3)2 group), also known as 
5 2-DMAOE, and 2 , -dimethylaminoethoxyethoxy (also known in the art as 

2 , -0-dimethylaminoethoxyethyl or 2'-DMAEOE), i.e, 
2 , -0-CH 2 -0-CH2-N(CH 2 )2. 

Other preferred modifications include 2'-methoxy(2'-0--CH3), 
2 , -aminopropoxy(2 , -OCH2CH2CH2NH2) and 2 , -fluoro (2'-F). Similar modifications 

10 may also be made at other positions on the oligonucleotide, particularly the 3' position 
of the sugar on the 3' terminal nucleotide or in 2'-5' linked oligonucleotides and the 5' 
position of 5' terminal nucleotide. Oligonucleotides may also have sugar mimetics 
such as cyclobutyl moieties in place of the pentofuranosyl sugar. 

Oligonucleotides may also include nucleobase (often referred to in the art 

15 simply as "base") modifications or substitutions. As used herein, "unmodified" or 
"natural" nucleobases include the purine bases adenine (A) and guanine (G), and the 
pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases 
include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 
5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and 

20 other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of 
adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and 
cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5 -uracil 
(pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and 
other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 

25 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 
7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 
7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further nucleobases include 
those disclosed in U.S. Pat. No. 3,687,808. Certain of these nucleobases are 
particularly useful for increasing the binding affinity of the oligomeric compounds of 

30 the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, 
N-6 and 0-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil 
and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase 
nucleic acid duplex stability by 0.6-1 .2. degree °C and are presently preferred base 
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substitutions, even more particularly when combined with 2 , -0-methoxyethyl sugar 
modifications. 

Another modification of the oligonucleotides of the present invention involves 
chemically linking to the oligonucleotide one or more moieties or conjugates that 
5 enhance the activity, cellular distribution or cellular uptake of the oligonucleotide. 
Such moieties include but are not limited to lipid moieties such as a cholesterol 
moiety, cholic acid, a thioether, (e.g., hexyl-S-tritylthiol), a thiocholesterol, an 
aliphatic chain, (e.g., dodecandiol or undecyl residues), a phospholipid, (e.g., 
di-hexadecyl-rac-glycerol or triethylammonium 

10 1 ,2-di-0-hexadecyl-rac-glycero-3-H-phosphonate), a polyamine or a polyethylene 
glycol chain or adamantane acetic acid, a palmityl moiety, or an octadecylamine or 
hexylamino-carbonyl-oxycholesterol moiety. 

One skilled in the relevant art knows well how to generate oligonucleotides 
containing the above-described modifications. The present invention is not limited to 

1 5 the antisensce oligonucleotides described above. Any suitable modification or 
substitution may be utilized. 

It is not necessary for all positions in a given compound to be uniformly 
modified, and in fact more than one of the aforementioned modifications may be 
incorporated in a single compound or even at a single nucleoside within an 

20 oligonucleotide. The present invention also includes antisense compounds that are 
chimeric compounds. "Chimeric" antisense compounds or "chimeras," in the context 
of the present invention, are antisense compounds, particularly oligonucleotides, 
which contain two or more chemically distinct regions, each made up of at least one 
monomer unit, i.e., a nucleotide in the case of an oligonucleotide compound. These 

25 oligonucleotides typically contain at least one region wherein the oligonucleotide is 
modified so as to confer upon the oligonucleotide increased resistance to nuclease 
degradation, increased cellular uptake, and/or increased binding affinity for the target 
nucleic acid. An additional region of the oligonucleotide may serve as a substrate for 
enzymes capable of cleaving RNA:DNA or RNA:RNA hybrids. By way of example, 

30 RNaseH is a cellular endonuclease that cleaves the RNA strand of an RNA:DNA 
duplex. Activation of RNase H, therefore, results in cleavage of the RNA target, 
thereby greatly enhancing the efficiency of oligonucleotide inhibition of gene 
expression. Consequently, comparable results can often be obtained with shorter 
oligonucleotides when chimeric oligonucleotides are used, compared to 
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phosphorothioate deoxyoligonucleotides hybridizing to the same target region. 
Cleavage of the RNA target can be routinely detected by gel electrophoresis and, if 
necessary, associated nucleic acid hybridization techniques known in the art. 

Chimeric antisense compounds of the present invention may be formed as 
5 composite structures of two or more oligonucleotides, modified oligonucleotides, 
oligonucleosides and/or oligonucleotide mimetics as described above. 

The present invention also includes pharmaceutical compositions and 
formulations that include the antisense compounds of the present invention as 
described below. 

10 

B. Genetic Therapies 

The present invention contemplates the use of any genetic manipulation for 
use in modulating the expression of stem cell cancer markers of the present invention. 
Examples of genetic manipulation include, but are not limited to, gene knockout (e.g., 

1 5 removing the cancer marker gene from the chromosome using, for example, 
recombination), expression of antisense constructs with or without inducible 
promoters, addition of a heterologous gene (e.g. controlled by an inducible promoter), 
and the like. Delivery of nucleic acid construct to cells in vitro or in vivo may be 
conducted using any suitable method. A suitable method is one that introduces the 

20 nucleic acid construct into the cell such that the desired event occurs (e.g., expression 
of an antisense construct). 

Introduction of molecules carrying genetic information into cells is achieved 
by any of various methods including, but not limited to, directed injection of naked 
DNA constructs, bombardment with gold particles loaded with said constructs, and 

25 macromolecule mediated gene transfer using, for example, liposomes, biopolymers, 
and the like. Preferred methods use gene delivery vehicles derived from viruses, 
including, but not limited to, adenoviruses, retroviruses, vaccinia viruses, and adeno- 
associated viruses. Because of the higher efficiency as compared to retroviruses, 
vectors derived from adenoviruses are the preferred gene delivery vehicles for 

30 transferring nucleic acid molecules into host cells in vivo. Adenoviral vectors have 
been shown to provide very efficient in vivo gene transfer into a variety of solid 
tumors in animal models and into human solid tumor xenografts in immune-deficient 
mice. Examples of adenoviral vectors and methods for gene transfer are described in 
PCT publications WO 00/12738 and WO 00/09675 and U.S. Pat. Appl. Nos. 
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6,033,908, 6,019,978, 6,001,557, 5,994,132, 5,994,128, 5,994,106, 5,981,225, 
5,885,808, 5,872,154, 5,830,730, and 5,824,544, each of which is herein incorporated 
by reference in its entirety. 

Vectors may be administered to a subject in a variety of ways. For example, 
5 in some embodiments of the present invention, vectors are administered into tumors 
or tissue associated with tumors using direct injection. In other embodiments, 
administration is via the blood or lymphatic circulation (See e.g., PCT publication 
99/02685 herein incorporated by reference in its entirety). Exemplary dose levels of 
adenoviral vector are preferably 10** to 10* * vector particles added to the perfusate. 

0 

C. Antibody Therapy 

In some embodiments, the present invention provides antibodies that target 
tumors that express a stem cell cancer marker of the present invention (e.g., those 
shown in Tables 4-8). Any suitable antibody (e.g., monoclonal, polyclonal, or 
5 synthetic) may be utilized in the therapeutic methods disclosed herein. In preferred 
embodiments, the antibodies used for cancer therapy are humanized antibodies. 
Methods for humanizing antibodies are well known in the art (See e.g., U.S. Patents 
6,180,370, 5,585,089, 6,054,297, and 5,565,332; each of which is herein incorporated 
by reference). 

0 In some embodiments, the therapeutic antibodies comprise an antibody 

generated against a stem cell cancer marker of the present invention, wherein the 
antibody is conjugated to a cytotoxic agent. In such embodiments, a tumor specific 
therapeutic agent is generated that does not target normal cells, thus reducing many of 
the detrimental side effects of traditional chemotherapy. For certain applications, it is 

5 envisioned that the therapeutic agents will be pharmacologic agents that will serve as 
useful agents for attachment to antibodies, particularly cytotoxic or otherwise 
anticellular agents having the ability to kill or suppress the growth or cell division of 
endothelial cells. The present invention contemplates the use of any pharmacologic 
agent that can be conjugated to an antibody, and delivered in active form. Exemplary 

0 anticellular agents include chemotherapeutic agents, radioisotopes, and cytotoxins. 
The therapeutic antibodies of the present invention may include a variety of cytotoxic 
moieties, including but not limited to, radioactive isotopes (e.g., iodine-131, iodine- 
123, technicium-99m, indium-Ill, rhenium-188, rhenium-186, gallium-67, copper- 
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67, yttrium-90, iodine- 125 or astatine-21 1), hormones such as a steroid, 

antimetabolites such as cytosines (e.g. 9 arabinoside, fluorouracil, methotrexate or 

aminopterin; an anthracycline; mitomycin C), vinca alkaloids (e.g., demecolcine; 
etoposide; mithramycin), and antitumor alkylating agent such as chlorambucil or 
5 melphalan. Other embodiments may include agents such as a coagulant, a cytokine, 
growth factor, bacterial endotoxin or the lipid A moiety of bacterial endotoxin. For 
example, in some embodiments, therapeutic agents will include plant-, fungus- or 
bacteria-derived toxin, such as an A chain toxins, a ribosome inactivating protein, a - 
sarcin, aspergillin, restrictocin, a ribonuclease, diphtheria toxin or pseudomonas 

10 exotoxin, to mention just a few examples. In some preferred embodiments, 
deglycosylated ricin A chain is utilized. 

In any event, it is proposed that agents such as these may, if desired, be 
successfully conjugated to an antibody, in a manner that will allow their targeting, 
internalization, release or presentation to blood components at the site of the targeted 

15 tumor cells as required using known conjugation technology (See, e.g., Ghose et al., 
Methods Enzymol., 93:280 [1983]). 

For example, in some embodiments the present invention provides 
immunotoxins targeted a stem cell cancer marker of the present invention. 
Immunotoxins are conjugates of a specific targeting agent typically a tumor-directed 

20 antibody or fragment, with a cytotoxic agent, such as a toxin moiety. The targeting 
agent directs the toxin to, and thereby selectively kills, cells carrying the targeted 
antigen. In some embodiments, therapeutic antibodies employ crosslinkers that 
provide high in vivo stability (Thorpe et al. y Cancer Res., 48:6396 [1988]). 

In other embodiments, particularly those involving treatment of solid tumors, 

25 antibodies are designed to have a cytotoxic or otherwise anticellular effect against the 
tumor vasculature, by suppressing the growth or cell division of the vascular 
endothelial cells. This attack is intended to lead to a tumor-localized vascular 
collapse, depriving the tumor cells, particularly those tumor cells distal of the 
vasculature, of oxygen and nutrients, ultimately leading to cell death and tumor 

30 necrosis. 

In preferred embodiments, antibody based therapeutics are formulated as 
pharmaceutical compositions as described below. In preferred embodiments, 
administration of an antibody composition of the present invention results in a 
measurable decrease in cancer (e.g., decrease or elimination of tumor). 
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D. RNAi Therapies 

In other embodiments, RNAi is used to regulate expression of the stem cell 
cancer markers of the present invention (e.g. those shown in Tables 4-8). RNAi 
represents an evolutionary conserved cellular defense for controlling the expression of 
5 foreign genes in most eukaryotes, including humans. RNAi is triggered by double- 
stranded RNA (dsRNA) and causes sequence-specific mRNA degradation of single- 
stranded target RNAs homologous in response to dsRNA. The mediators of mRNA 
degradation are small interfering RNA duplexes (siRNAs), which are normally 
produced from long dsRNA by enzymatic cleavage in the cell. siRNAs are generally 

1 0 approximately twenty-one nucleotides in length (e.g. 21-23 nucleotides in length), and 
have a base-paired structure characterized by two nucleotide 3-overhangs. Following 
the introduction of a small RNA, or RNAi, into the cell, it is believed the sequence is 
delivered to an enzyme complex called RISC (RNA-induced silencing complex). 
RISC recognizes the target and cleaves it with an endonuclease. It is noted that if 

1 5 larger RNA sequences are delivered to a cell, RNase III enzyme (Dicer) converts 
longer dsRNA into 21-23 nt ds siRNA fragments. 

Chemically synthesized siRNAs have become powerful reagents for genome- 
wide analysis of mammalian gene function in cultured somatic cells. Beyond their 
value for validation of gene function, siRNAs also hold great potential as gene- 

20 specific therapeutic agents (Tuschl and Borkhardt, Molecular Intervent. 2002; 
2(3): 158-67, herein incorporated by reference). 

The transfection of siRNAs into animal cells results in the potent, long-lasting 
post-transcriptional silencing of specific genes (Caplen et al, Proc Natl Acad Sci 
U.S.A. 2001; 98: 9742-7; Elbashir et al., Nature. 2001; 41 1 :494-8; Elbashir et al., 

25 Genes Dev. 2001;15: 188-200; and Elbashir et al., EMBO J. 2001; 20: 6877-88, all 
of which are herein incorporated by reference). Methods and compositions for 
performing RNAi with siRNAs are described, for example, in U.S. Patent 6,506,559, 
herein incorporated by reference. 

siRNAs are extraordinarily effective at lowering the amounts of targeted 

30 RNA, and by extension proteins, frequently to undetectable levels. The silencing 

effect can last several months, and is extraordinarily specific, because one nucleotide 
mismatch between the target RNA and the central region of the siRNA is frequently 
sufficient to prevent silencing Brummelkamp et al, Science 2002; 296:550-3; and 
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Holen et al, Nucleic Acids Res. 2002; 30:1757-66, both of which are herein 
incorporated by reference. 

E. Pharmaceutical Compositions 

5 The present invention further provides pharmaceutical compositions (e.g., 

comprising a small molecule, antisent, antibody, or siRNA that targets the stem cell 
cancer markers of the present invention). The pharmaceutical compositions of the 
present invention may be administered in a number of ways depending upon whether 
local or systemic treatment is desired and upon the area to be treated. Administration 

10 may be topical (including ophthalmic and to mucous membranes including vaginal 
and rectal delivery), pulmonary (e.g., by inhalation or insufflation of powders or 
aerosols, including by nebulizer; intratracheal, intranasal, epidermal and transdermal), 
oral or parenteral. Parenteral administration includes intravenous, intraarterial, 
subcutaneous, intraperitoneal or intramuscular injection or infusion; or intracranial, 

1 5 e.g. , intrathecal or intraventricular, administration. 

Pharmaceutical compositions and formulations for topical administration may 
include transdermal patches, ointments, lotions, creams, gels, drops, suppositories, 
sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder 
or oily bases, thickeners and the like may be necessary or desirable. 

20 Compositions and formulations for oral administration include powders or 

granules, suspensions or solutions in water or non-aqueous media, capsules, sachets or 
tablets. Thickeners, flavoring agents, diluents, emulsifiers, dispersing aids or binders 
may be desirable. 

Compositions and formulations for parenteral, intrathecal or intraventricular 
25 administration may include sterile aqueous solutions that may also contain buffers, 

diluents and other suitable additives such as, but not limited to, penetration enhancers, 
carrier compounds and other pharmaceutical^ acceptable carriers or excipients. 

Pharmaceutical compositions of the present invention include, but are not 
limited to, solutions, emulsions, and liposome-containing formulations. These 
30 compositions may be generated from a variety of components that include, but are not 
limited to, preformed liquids, self-emulsifying solids and self-emulsifying semisolids. 

The pharmaceutical formulations of the present invention, which may 
conveniently be presented in unit dosage form, may be prepared according to 
conventional techniques well known in the pharmaceutical industry. Such techniques 
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include the step of bringing into association the active ingredients with the 
pharmaceutical carrier(s) or excipient(s). In general the formulations are prepared by 
uniformly and intimately bringing into association the active ingredients with liquid 
carriers or finely divided solid carriers or both, and then, if necessary, shaping the 
5 product. 

The compositions of the present invention may be formulated into any of 
many possible dosage forms such as, but not limited to, tablets, capsules, liquid 
syrups, soft gels, suppositories, and enemas. The compositions of the present 
invention may also be formulated as suspensions in aqueous, non-aqueous or mixed 

1 0 media. Aqueous suspensions may further contain substances that increase the 

viscosity of the suspension including, for example, sodium carboxymethylcellulose, 
sorbitol and/or dextran. The suspension may also contain stabilizers. 

In one embodiment of the present invention the pharmaceutical compositions 
may be formulated and used as foams. Pharmaceutical foams include formulations 

1 5 such as, but not limited to, emulsions, microemulsions, creams, jellies and liposomes. 
While basically similar in nature these formulations vary in the components and the 
consistency of the final product. 

Agents that enhance uptake of oligonucleotides at the cellular level may also 
be added to the pharmaceutical and other compositions of the present invention. For 

20 example, cationic lipids, such as lipofectin (U.S. Pat. No. 5,705,188), cationic 

glycerol derivatives, and polycationic molecules, such as polylysine (WO 97/30731), 
also enhance the cellular uptake of oligonucleotides. 

The compositions of the present invention may additionally contain other 
adjunct components conventionally found in pharmaceutical compositions. Thus, for 

25 example, the compositions may contain additional, compatible, 

pharmaceutically-active materials such as, for example, antipruritics, astringents, local 
anesthetics or anti-inflammatory agents, or may contain additional materials useful in 
physically formulating various dosage forms of the compositions of the present 
invention, such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, 

30 thickening agents and stabilizers. However, such materials, when added, should not 
unduly interfere with the biological activities of the components of the compositions 
of the present invention. The formulations can be sterilized and, if desired, mixed with 
auxiliary agents, e.g., lubricants, preservatives, stabilizers, wetting agents, emulsifiers, 
salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic 
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substances and the like which do not deleteriously interact with the nucleic acid(s) of 
the formulation. 

Certain embodiments of the invention provide pharmaceutical compositions 
containing (a) one or more compounds that modulate the activity of a stem cell caner 

5 marker (e.g. antibody, small molecule, siRNA, anti-sense, etc.) and (b) one or more 
other chemotherapeutic agents. Examples of such chemotherapeutic agents include, 
but are not limited to, anticancer drugs such as daunorubicin, dactinomycin, 
doxorubicin, bleomycin, mitomycin, nitrogen mustard, chlorambucil, melphalan, 
cyclophosphamide, 6-mercaptopurine, 6-thioguanine, cytarabine (CA), 5-fluorouracil 

10 (5-FU), floxuridine (5-FUdR), methotrexate (MTX), colchicine, vincristine, 
vinblastine, etoposide, teniposide, cisplatin and diethylstilbestrol (DES). 
Anti-inflammatory drugs, including but not limited to nonsteroidal anti-inflammatory 
drugs and corticosteroids, and antiviral drugs, including but not limited to ribivirin, 
vidarabine, acyclovir and ganciclovir, may also be combined in compositions of the 

15 invention. Other chemotherapeutic agents are also within the scope of this invention. 
Two or more combined compounds may be used together or sequentially. 

Dosing is dependent on severity and responsiveness of the disease state to be 
treated, with the course of treatment lasting from several days to several months, or 
until a cure is effected or a diminution of the disease state is achieved (e.g.,reduction 

20 in tumor size). Optimal dosing schedules can be calculated from measurements of 
drug accumulation in the body of the patient. The administering physician can easily 
determine optimum dosages, dosing methodologies and repetition rates. Optimum 
dosages may vary depending on the relative potency of individual oligonucleotides, 
and can generally be estimated based on EC50S found to be effective in in vitro and in 

25 vivo animal models or based on the examples described herein. In general, dosage is 
from 0.01 jig to 100 g per kg of body weight, and may be given once or more daily, 
weekly, monthly or yearly. The treating physician can estimate repetition rates for 
dosing based on measured residence times and concentrations of the drug in bodily 
fluids or tissues. Following successful treatment, it may be desirable to have the 

30 subject undergo maintenance therapy to prevent the recurrence of the disease state, 
wherein the oligonucleotide is administered in maintenance doses, ranging from 0.01 
ug to 100 g per kg of body weight, once or more daily, to once every 20 years. 

84 



BNSOOCID: <WO 2005005601 A2 1 > 



WO 2005/005601 



PCT/US2004/018266 



VIII. Transgenic Animals Expressing Cancer Marker Genes 

The present invention contemplates the generation of transgenic animals 
comprising an exogenous cancer marker gene of the present invention or mutants and 
variants thereof (e.g., truncations or single nucleotide polymorphisms) or knock-outs 
5 thereof. In preferred embodiments, the transgenic animal displays an altered 

phenotype (e.g., increased or decreased presence of markers) as compared to wild- 
type animals. Methods for analyzing the presence or absence of such phenotypes 
include but are not limited to, those disclosed herein. In some preferred 
embodiments, the transgenic animals further display an increased or decreased growth 
10 of tumors or evidence of cancer. 

The transgenic animals of the present invention find use in drug (e.g., cancer 
therapy) screens. In some embodiments, test compounds (e.g., a drug that is 
suspected of being useful to treat cancer) and control compounds (e.g., a placebo) are 
administered to the transgenic animals and the control animals and the effects 
1 5 evaluated. 

The transgenic animals can be generated via a variety of methods. In some 
embodiments, embryonal cells at various developmental stages are used to introduce 
transgenes for the production of transgenic animals. Different methods are used 
depending on the stage of development of the embryonal cell. The zygote is the best 

20 target for micro-injection. In the mouse, the male pronucleus reaches the size of 
approximately 20 micrometers in diameter that allows reproducible injection of 1-2 
picoliters (pi) of DNA solution. The use of zygotes as a target for gene transfer has a 
major advantage in that in most cases the injected DNA will be incorporated into the 
host genome before the first cleavage (Brinster et al., Proc. Natl. Acad. Sci. USA 

25 82:4438-4442 [1985]). As a consequence, all cells of the transgenic non-human 

animal will carry the incorporated transgene. This will in general also be reflected in 
the efficient transmission of the transgene to offspring of the founder since 50% of the 
germ cells will harbor the transgene. U.S. Patent No. 4,873,191 describes a method 
for the micro-injection of zygotes; the disclosure of this patent is incorporated herein 

30 in its entirety. 

In other embodiments, retroviral infection is used to introduce transgenes into 
a non-human animal. In some embodiments, the retroviral vector is utilized to 
transfect oocytes by injecting the retroviral vector into the perivitelline space of the 
oocyte (U.S. Pat. No. 6,080,912, incorporated herein by reference). In other 
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embodiments, the developing non-human embryo can be cultured in vitro to the 
blastocyst stage. During this time, the blastomeres can be targets for retroviral 
infection (Janenich, Proc. Natl Acad. Sci. USA 73:1260 [1976]). Efficient infection 
of the blastomeres is obtained by enzymatic treatment to remove the zona pellucida 
5 (Hogan et al, in Manipulating the Mouse Embryo >, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. [1986]). The viral vector system used to introduce 
the transgene is typically a replication-defective retrovirus carrying the transgene 
(Jahner et ai, Proc. Natl. Acad Sci. USA 82:6927 [1985]). Transfection is easily and 
efficiently obtained by culturing the blastomeres on a monolayer of virus-producing 

10 cells (Stewart, et al. EMBO J., 6:383 [1987]). 

Alternatively, infection can be performed at a later stage. Virus or virus- 
producing cells can be injected into the blastocoele (Jahner et al, Nature 298:623 
[1982]). Most of the founders will be mosaic for the transgene since incorporation 
occurs only in a subset of cells that form the transgenic animal. Further, the founder 

15 may contain various retroviral insertions of the transgene at different positions in the 
genome that generally will segregate in the offspring. In addition, it is also possible 
to introduce transgenes into the germline, albeit with low efficiency, by intrauterine 
retroviral infection of the midgestation embryo (Jahner et al., supra [1982]). 
Additional means of using retroviruses or retroviral vectors to create transgenic 

20 animals known to the art involve the micro-injection of retroviral particles or 

mitomycin C-treated cells producing retrovirus into the perivitelline space of fertilized 
eggs or early embryos (PCT International Application WO 90/08832 [1990], and 
Haskell and Bowen, Mol. Reprod. Dev., 40:386 [1995]). 

In other embodiments, the transgene is introduced into embryonic stem cells 

25 and the transfected stem cells are utilized to form an embryo. ES cells are obtained 
by culturing pre-implantation embryos in vitro under appropriate conditions (Evans et 
al. Nature 292:154 [1981]; Bradley et al, Nature 309:255 [1984]; Gossler et ai, 
Proc. Acad. Sci. USA 83:9065 [1986]; and Robertson et al, Nature 322:445 [1986]). 
Transgenes can be efficiently introduced into the ES cells by DNA transfection by a 

30 variety of methods known to the art including calcium phosphate co-precipitation, 
protoplast or spheroplast fusion, lipofection and DEAE-dextran-mediated 
transfection. Transgenes may also be introduced into ES cells by retrovirus-mediated 
transduction or by micro-injection. Such transfected ES cells can thereafter colonize 
an embryo following their introduction into the blastocoel of a blastocyst-stage 
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embryo and contribute to the germ line of the resulting chimeric animal (for review, 
See, Jaenisch, Science 240:1468 [1988]). Prior to the introduction of transfected ES 
cells into the blastocoel, the transfected ES cells may be subjected to various selection 
protocols to enrich for ES cells which have integrated the transgene assuming that the 
5 transgene provides a means for such selection. Alternatively, the polymerase chain 
reaction may be used to screen for ES cells that have integrated the transgene. This 
technique obviates the need for growth of the transfected ES cells under appropriate 
selective conditions prior to transfer into the blastocoel. 

In still other embodiments, homologous recombination is utilized to knock-out 
1 0 gene function or create deletion mutants (e.g., truncation mutants). Methods for 
homologous recombination are described in U.S. Pat. No. 5,614,396, incorporated 
herein by reference. 



EXPERIMENTAL 

1 5 The following examples are provided in order to demonstrate and further 

illustrate certain preferred embodiments and aspects of the present invention and are 
not to be construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations 
apply: N (normal); M-(molar); mM (millimolar); fiM (micromolar); mol (moles); 

20 mmol (millimoles); \imo\ (micromoles); nmol (nanomoles); pmol (picomoles); g 
(grams); mg (milligrams); jig (micrograms); ng (nanograms); 1 or L (liters); ml 
(milliliters); jil (microliters); cm (centimeters); mm (millimeters); ^m (micrometers); 
nm (nanometers); and°C (degrees Centigrade). 



25 



30 



Example 1 

Establishing and Analyzing a Solid Tumor Cell Xenograft Model 

This examples describes the generation of tumors in mice using human solid 
tumor cells from humans and the analysis of these tumors. 
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Materails and Methods 

Mouse preparation. 8-week old female NOD-SCID mice were anesthetized 
by an intra-peritoneal injection of 0.2ml Ketamine/Xylazine (300mg Ketamine 
combined with 20mg Xylazine in a 4ml volume. 0.02ml of the solution was used per 

5 20g mouse). Dilution to 2004 was done using HBSS. Mice were then treated with 
VP-16 (etoposide) via an intra-peritoneal injection (30mg etoposide dose per 1 kg 
mouse, diluted in serum-free HBSS for a final injection volume of 2004). At the 
same time, estrogen pellets were placed subcutaneously on the back of the mouse's 
neck using a trocar. All tumor injections/implants were done 5 days after this 

10 procedure. In the following procedures, mice were anesthetized as described above. 

Primary tumor specimen implantations. For the implantation of fresh 
specimens, samples of human breast tumors were received within an hour after 
surgery. The tumors were cut up with scissors into small pieces, and the pieces were 
then minced with a blade to yield 2x2 mm-size pieces. Mincing was done in sterile 

15 RPMI 1640 medium supplemented with 20% Fetal Bovine Serum (FBS) under sterile 
conditions on ice. The tumor pieces were washed with serum- free HBSS before 
implantation. A 2-mm incision was then made in the mid abdomen area, and using a 
trocar, one to two small tumor pieces were implanted in the region of the upper right 
and upper left mammary fat pats (right below the second nipple on both sides). A 6-0 

20 suture was wrapped twice around the MFP-Nipple allowing it to hold the implanted 
pieces in place. Sutures were removed after 5 days. Nexaban was used to seal the 
incision and mice were monitored weekly for tumor growth. 

Pleural effusions injections. For the injection of the pleural effusions, cells 
were received shortly after thorocentesis and washed with serum-free HBSS. Cells 

25 were then suspended in serum free-RPMI/Matrigel mixture (1:1 volume) and then 
injected into the upper right and left mammary pads using an 18G needed. 0.2ml 
containing 1-2 million cells were typically injected. The site of the needle injection 
was sealed with Nexaban to prevent any cell leakage. 

Preparation of single cell suspensions of tumor cells. Prior to digestion with 

30 collagenase, Xenograft tumors or primary human tumors were cut up into small 
pieces and then minced completely using sterile blades. To obtain single cell 
suspensions, either pleural effusion cells or the resulting tumor pieces were then 
mixed with ultra-pure Collagenase III in HBSS solution (200-250U Collagenase per 
ml) and allowed to incubate at 37°C for 3-4 hours. Pipetting with a 10ml pipette was 
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done every 15-20 minutes. At the end of the incubation, cells were filtered through a 
45fi nylon mesh and washed with RPMI-20% FBS, then washed twice with HBSS. 
Cells to be injected were then suspended in HBSS/Matrigel mix (1:1 volume) and 
injected into the area of the mammary fat pad as described above. Nexaban was used 
5 to seal the injection site. 

Cell staining for flow-cytometry. Cells were counted and then transferred to 
a 5 ml tube, washed twice with HBSS with 2% Heat-inactivated calf serum (HICS) (5 
min @ 1000 rpm), then re-suspended in lOOfl (per 10 6 cells) of HBSS with 2% HICS. 
5ml of Sandoglobin solution (lmg/ml) was then added and incubated on ice for 10 

10 minutes, after which the sample was washed twice with HBSS 2% HICS and re- 
suspended in 100ml (per 10 6 cells) of HBSS 2% HICS. Antibodies (using appropriate 
dilution per antibody) were then added and incubated for 20 minutes on ice, and then 
washed twice with HBSS 2% HICS. When needed, a secondary antibody addition 
was conducted by re-suspending in lOOul (per 10 6 cells) of HBSS 2% HICS, and then 

1 5 adding 1 -4 ml of secondary antibody (depending on the secondary antibody and its 
concentration), followed by a 20 minute incubation. When a streptavidin step was 
used, cells were re-suspended in lOOul (per 10 6 cells) of HBSS 2% HICS and then lul 
of strepavidin conjugated with the indicated fluorescent dye was added, followed by a 
20 minute incubation. The cells were washed twice with HBSS 2% heat-inactivated 

20 fetal calf serum (HICS) and re-suspended in 0.5ml (per million cells) of HBSS 2% 
HICS that contained 7AAD (1 mg/ml final concentration). 

Flow-cytometry. The antibodies used were anti-CD44 (APC, PE or Biotin), 
anti-CD24 (PE or FITC), anti-B38.1 (APC), anti-ESA-FITC (Biomeda, CA), anti- 
H2K d , (Santa Cruz Products, Santa Cruz, CA). Lineage marker antibodies were anti- 

25 CD2, -CD3 - CD10, -CD16, -CD18, -CD31, -CD64 and -CD140b. Unless noted, 
antibodies were purchased from Pharmingen (San Diego, CA). Antibodies were 
directly conjugated to various fluorochromes depending on the tests. In all tests, 
mouse cells and/or Lineage*** cells were eliminated by discarding H2K d+ (class I 
MHC) cells or Lineage* 1 " cells during flow-cytometry. Dead cells were eliminated 

30 using the viability dye 7-AAD. Flow-cytometry was performed on a FACSVantage 
(Becton Dickinson, San Jose, CA). Side scatter and forward scatter profiles were 
used to eliminate cell doublets. Cells were routinely sorted twice and the cells were 
re-analyzed for purity, which typically was greater than 95%. 
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In solid tumors, it has been demonstrated that only a small proportion of the 
tumor cells are able to form colonies in an in vitro clonogenic assay 21 - 24 » ,0M03 . 
Furthermore, large numbers of cells must typically be transplanted to form tumors in 
xenograft models. One possible explanation for these observations is that every cell 
5 within a tumor has the ability to proliferate and form new tumors but that the 
probability of an individual cell completing the necessary steps in these assays is 
small. An alternative explanation is that only a rare, phenotypically distinct subset of 
cells has the capacity to significantly proliferate and form new tumors, but that cells 
within this subset do so very efficiently 25 . To distinguish between these possibilities 

10 it is necessary to identify the clonogenic cells in these tumors with markers that 

distinguish these cells from other non-tumorigenic cells. This has been accomplished 
in acute myelogenous leukemia (AML), where it was demonstrated that a specific 
subpopulation of leukemia cells (that expressed markers similar to normal 
hematopoietic stem cells) was consistently enriched for clonogenic activity in 

1 5 NOD/SCID immunocompromised mice while other cancer cells were depleted of 
clonogenic activity 104 ' 106 . Such tests have not been reported in solid cancers. 

To investigate the mechanisms of solid tumor heterogeneity, a mouse model 
was developed that was a modification of the NOD/SCID immunodeficient mouse 
model in which human breast cancers were efficiently propagated in the mouse 

20 mammary fat pad In the present application, it was shown that solid tumors 

contain a distinct population of cells with the exclusive ability to form tumors in mice. 
These cells are referred to as tumorigenic cells or cancer initiating cells since they 
consistently formed tumors while other cancer cell populations were depleted of cells 
capable of tumor formation. Cell surface markers were identified which can 

25 distinguish between these cell populations. These findings provide a new model of 
breast tumor biology in which a defined subset of cells drives tumorigenesis, as well 
as generating tumor cell heterogeneity. The prospective identification of this 
tumorigenic population of cancer cells allows for the identification of molecules 
expressed in these cells that can then serve as targets to eliminate this critical 

30 population of cancer cells. 

Tumor specimens and engraftment rate. Human breast cancer specimens 
obtained from primary or metastatic sites in 9 different patients (designated tumors 1- 
9; T1-T9) all engrafted in the NOD/SCID mice. (Table 1). In one case, the cancer 
cells were obtained from a primary breast tumor (T2) while in other cases the cells 
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were obtained from metastatic pleural effusions (Tl, T3-T9). Some tests were 
conducted on cells after they had been passaged once or twice in mice (designated 
Passage 1 & 2) while other tests were conducted on unpassaged fresh or frozen tumor 
samples obtained directly from patients. When using human cancer cells from tumors 
passaged in mice, contaminating mouse cells were removed by eliminating H2K + 
cells [mouse histocompatability class I (MHC)]. 



Table 1 



Tumor 


Origin 


Formation 
In mice 


Passage 
In mice 


Diagnosis 


1 


Metastasis 


Yes 


Yes 


Infiltrating ductal carcinoma 


T2 


Breast 
Primary 


Yes 


Yes 


Adenocarcinoma 


T3 


Metastasis 


Yes 


Yes 


Invasive lobular carcinoma 


T4 


Metastasis 


Yes 


No 


Invasive lobular carcinoma 


T5 


Metastasis 


Yes 


Yes 


Invasive lobular carcinoma 


T6 


Metastasis 


Yes 


Yes 


Inflammatory breast carcinoma 


T7 


Metastasis 


Yes 


Yes 


Invasive lobular carcinoma 


T8 


Metastasis 


Yes 


Yes 


Inflammatory breast carcinoma 


T9 


Metastasis 


Yes 


Yes 


Adenocarcinoma 



Table 1 presented the results of engraftment of human breast cancers into 
NOD/SCBD mice. Mice were injected with unsorted Tl and T3 cells, and a 2 mm 
piece of T2. Cells from T4-T9 were isolated by flow cytometry as described in Figure 
1. All 9 tumors tested engrafted in the NOD/SCID mouse model. Except for T2 
which was a primary breast tumor, all other tumors were metastases. All of the 
tumors were passaged serially in mice except for T4. 

Identification of tumorigenicity markers. Breast cancer cells were 
heterogeneous with respect to expression of a variety of cell surface-markers 
including CD44, CD24, and B38.1. CD24 and CD44 are adhesion molecules, while 
B38.1 has been described as a breast/ovarian cancer-specific marker 9 ' !07 * 108 . To 
determine whether these markers could distinguish tumorigenic from non-tumorigenic 
cells, flow-cytometry was used to isolate cells that were positive or negative for each 
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marker from first passage Tl or T2 cells. When 2xl0 5 -8xl0 5 cells of each population 
were injected, all injections of CD44 + cells (8/8), B38.1* cells (8/8), or CD24' n ° w cells 
(12/12) gave rise to visible tumors within 12 weeks of injection, but none of the 
CD44" cell (0/8), or B38.1" cell (0/8) injections formed detectable tumors (Table 2). 
5 Although no tumors could be detected by palpation in the locations injected with 

CD24* cells, 2 of 12 mice injected with CD24 + cells did contain small growths at the 
injection site that were detected upon necropsy. These growths most likely arose from 
the 1-3% of CD24* cells that invariably contaminate the sorted CD24 + cells, or 
alternatively from CD24 + cells with reduced proliferative capacity (Table 2). Because 

1 0 the CD44 + cells were exclusively B38. 1 + , we focused on the CD44 and CD24 markers 
in subsequent tests. 

Several antigens associated with normal cell types (Lineage markers; CD2, 
CD3, CD10, CD16, CD18, CD31, CD64, and CD140b) were found not to be 
expressed by the cancer cells based on analyses of tumors that had been passaged 

1 5 multiple times in mice. By eliminating Lineage + cells from unpassaged or early 

passage tumor cells, normal human leukocytes, endothelial cells, mesothelial cells and 
fibroblasts were eliminated. By microscopic examination, the Lineage* tumor cells 
had the appearance of neoplastic cells (Figure 6). 
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Table 2 



Tumors/Injections 


Cells/Injection 


8xl0 5 


5x10 s 


2x10 s 


Passsaged Tl 








CD44- 


0/2 


0/2 


- 


CD44+ 


2/2 


2/2 


- 


B38.1- 


0/2 


0/2 


- 


B38.1+ 


2/2 


2/2 




CD24+ 






1/6 


CD24- 






6/6 


Passaged T2 








CD44- 


0/2 


0/2 




CD44+ 


2/2 


2/2 




B38.1- 


0/2 


0/2 




B38.1+ 


2/2 


2/2 




CD24+ 






1/6 


CD24- 






6/6 



Table 2 shows the results of cells isolated by flow cytometry as described in 
figure 1 based upon expression of the indicated marker and assayed for the ability to 
5 form tumors after injection into the mammary fat pads of NOD/SCED mice. For 12 
weeks, mice were examined weekly for tumors by observation and palpation, then all 
mice were necropsied to look for growths at injection sites that were too small to 
palpate. The number of tumors that formed/ the number of injections that were 
performed is indicated for each population. All tumors were readily apparent by 

10 visual inspection and palpation except for tumors from the CD24+ population that 
were only detected upon necropsy. 

Depending on the tumor, 1 1% to 35% of the Lineage' cancer cells in tumors or 
pleural effusions were CD44 + CD24" /low (Figure 4a- If). CD44 + CD24* /!ow Lineage" cells 
or other populations of Lineage* cancer cells that had been isolated from nine patients 

15 were injected into the mammary fat pads of mice (Table 3). When injecting unsorted, 
passaged Tl or T2 cells, 5xl0 4 cells consistently gave rise to tumors, but 10 4 cells 
gave rise to tumors in only a minority of cases. In contrast, as few as 10 3 Tl or T2 
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CD44 + CD24" /l0w Lineage cells gave rise to tumors in all cases (Table 3). In Tl and 
T2, up to 2xl0 4 cells that were CD44 + Lineage" but CD24 + failed to form tumors. 
These data suggest that the CD44 + CD24* /low Lineage' population is 10-50 fold enriched 
for the ability to form tumors in NOD/SCID mice relative to unfractionated tumor 
5 cells. Whether the CD44*CD24" /low Lineage cells were isolated from passaged tumors 
(Tl, T2, T3) or from unpassaged cancer cells obtained directly from patients (Tl, T4- 
T6, T8, T9), they were enriched for tumorigenic activity. Note that T7 was the only 
one of 9 cancers studied that did not fit this pattern (Figure 4f). Other than T7, 
CD24 + Lineage~ cancer cells in both unpassaged and passaged tumors were unable to 
1 0 form new tumors (Table 3). Therefore, the xenograft and unpassaged patient tumors 
were composed of similar populations of phenotypically diverse cancer cell types, and 
in both cases only the CD44 + CD24~ /low Lineage" cells had the capacity to proliferate to 
form new tumors (p<0.001). 



15 Table 3 



# of cells per 
injection 


5xl0 5 


10 s 


5 

xlO 4 


2 
xlO 4 


10 4 


5 

xlO 3 


10 i 


500 


200 


100 


Mouse passage 1 






















Unsorted 


8/8 


8/8 


10/10 




3/12 




0/12 








CD44*CD24 + 








0/10 


0/10 


0/14 


0/10 








CD44*CD24-" 0W 








10/10 


10/10 


14/14 


10/1 
0 








CD44 + CD24- 














10/1 
0* 


4/4 


4/4 


1/6 


CD44 + CD24- 
" 0W ESA- 














0/10 

• 


0/4 


0/4 


0/6 


Mouse passage 2 






















CD44 + CD24 + 










0/9 












CD44*CD24-" 0W 










9/9 
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Patients ' tumor cells 






















CD44 + CD24 + 




0/3 


0/4 


0/8 


1/13 


0/2 










CD44 + CD24"" ow 




3/3 


4/4 




11/13 


1/1 










CD44 + CD24" 
/low ESA + 












2/2 


2/2 








CD44+CD24" 












2/2" 


0/2 









As shown in Table 3, tumorigenic breast cancer cells were highly enriched in 
the ESA+CD44+CD24-/low population. Cells were isolated from first passage 
(designated Mouse Passage 1) Tumor 1, Tumor 2 and Tumor 3, second passage Tumor 
5 3 (designated mouse Passage 2), unpassaged cells obtained from 6 different patients, 
Tl, T4, T5, T6 , T8 and T9, (designated Patients* tumor cells). 
CD44+CD24+Lineage- populations and CD44+CD24-/lowLineage- cells were 
isolated by flow-cytometry as described in figure 1 . The indicated number of cells of 
each phenotype was injected into the breast of NOD/SCID mice. The frequency of 

10 tumorigenic cells calculated by the modified maximum likelihood analysis method is 
-5/1 0 5 if single tumorigenic cells were capable of forming tumors, and every 
transplanted tumorigenic cell gave rise to a tumor 109 . Therefore, this calculation may 
underestimate the frequency of the tumorigenic cells since it does not take into 
account cell-cell interactions and local environment factors that may influence 

15 engraftment. In addition to the markers that are shown, all sorted cells in all tests 

were Lineage-, and the tumorigenic cells from Tl, T2, and T3 were further selected as 
B38.1+. The mice were observed weekly for 4-6 x A months, or until the mice became 
sick from the tumors. #Tumor formation by T5 ESA-CD44+CD24-/lowLINEAGE- 
cells was delayed by 2-4 weeks. *2,000 cells were injected in these tests. 

20 Figure 1 shows isolation of tumorigenic cells. Flow cytometry was used to 

isolate subpopulations of Tumor 1 (a, b), Tumor 3 (c), Tumor 5 (d), Tumor 6 (e) and 
Tumor 7 cells (f) that were tested for tumorigenicity in NOD/SCID mice. Tl (b) and 
T3 (c) had been passaged (P) once in NOD/SCID mice while the rest of the cells were 
frozen or unfrozen samples obtained directly after removal from a patient (UP). Cells 

25 were stained with antibodies against CD44, CD24, Lineage markers, and mouse-H2K 
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(for passaged tumors obtained from mice), and 7 AAD. Dead cells (7 AAD+), mouse 
cells (H2K+) and Lineage* normal cells were eliminated from all analyses. Each plot 
in Figure 1 depicts the CD24 and CD44 staining patterns of live human Lineage- 
cancer cells, and the frequency of the boxed tumori genie cancerpopulation as a 
5 percentage of cancer cells/all cells in each specimen is shown. 

In three of the tumors, further enrichment of tumorigenic activity was possible 
by isolating the ESA* subset of the CD44 + CD24" /low population. ESA (Epithelial 
Specific Antigen, Ep-CAM) has been used in the past to distinguish epithelial cancer 
cells from benign reactive mesothelial cells 110 . When ESA + CD44 + CD24' /low Lineage" 

10 cells were isolated from passaged Tl, as few as 200 cells consistently formed tumors 
of approximately 1 cm between 5-6 months after injection whereas 2000 ESA" 
CD44 + CD24' /low Lineage cells or 20,000 CD44 + CD24 + cells always failed to form 
tumors (Table 3). Ten thousand unsorted cells formed tumors in only 3 of 12 mice. 
This suggests that the ESA + CD44 + CD24" /low Lineage* population was more than 50 

1 5 fold enriched for the ability to form tumors relative to unfractionated tumor cells 

(Table 3). The ESA + CD44 + CD24 _/l0W Lineage population accounted for 2-4% of first 
passage Tl cells (2.5-5% of cancer cells). The ESA + CD44 + CD24 _/,ow Lineage 
population (0.6% of cancer cells) from unpassaged T5 cells was also enriched for 
tumorigenic activity compared to ESA'CD44 + CD24" /low Lineage" cells, but both the 

20 ESA + and ESA" fractions had some tumorigenic activity (Table 3). Among 

unpassaged T5 cells, as few as 1000 ESA + CD44 + CD24" /,ow Lineage~ cells consistently 
formed tumors. 

In order to determine whether the difference in tumorigenicity of the cell 
populations was due to differences in cell cycle, populations were analyzed by flow- 
25 cytometry. Comparison of the cell cycle status of tumorigenic and non-tumorigenic 
cancer cells from Tl revealed that both exhibited a similar cell cycle distribution (Fig. 
2a, 2b). Therefore, neither population was enriched for cells at a particular stage of 
the cell-cycle, and the non-tumorigenic cells were able to undergo at least a limited 
number of divisions in the xenograft model. 
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Figure 2 shows the DNA content of tumorigenic and non-tumorigenic breast 
cancer cells. The cell cycle status of the ESA+CD44+CD24-/lowLineage- 
tumorigenic cells (a) and the remaining Lineage- non-tumorigenic cancer cells (b) 
isolated from Tl were determined by hoechst 33342 staining of DNA content (20). 
5 The tumorigenic and non-tumorigenic cell populations exhibited similar cell cycle 
distributions 

Six months after inoculation, the injection sites of 20,000 tumorigenic 
CD44 + CD24~ /IOW Lineage~ cells and 20,000 CD44 + CD24 + Lineage" cells were examined 
by histology. The CD44 + CD24" /low Lineage* injection sites contained tumors 

10 approximately 1 cm in diameter while the CD44 + CD24 + Lineage~ injection sites 

contained no detectable tumors (Figure 6c). Only normal mouse mammary tissue was 
seen by histology at the sites of the CD44 + CD24 + Lineage~ injections (Figure 3a), 
whereas the tumors formed the CD44 + CD24 vlow Lineage" cells contained malignant 
cells as judged by hematoxylin and eosin stained sections (Figure 3b). Even when 

15 CD44 + CD24 + Lineage" injection sites from 58 mice, each administered 1,000-50,000 
cells, were examined after 16-29 weeks, no tumors were detected. Furthermore, the 
tumorigenic and non-tumorigenic populations were indistinguishable 
morphologically. Both the tumorigenic and non-tumorigenic subsets of Lineage- cells 
from passaged and unpassaged tumors contained >95% cancer cells as judged by 

20 Wright staining or Papanicolaou staining and microscopic analysis. By histology, the 
CD44 + CD24~ /low Lineage" cells and the rest of the Lineage- cells had the appearances 
of epithelial cancer cells (Figure 3d, 3e). 

Figure 3 shows histology from the CD24+ injection site (a), (20x objective 

magnification) revealed only normal mouse tissue while the CD24"^ 0W injection site 

25 (b), (40x objective magnification) contained malignant cells, (c) A representative 
tumor in a mouse at the CD44 + CD24 v,0W Lineage' injection site, but not at the 
CD44 + CD24 + Lineage* injection site. T3 cells were stained with Papanicolaou stain 
and examined microscopically (lOOx objective). Both the non-tumorigenic (c) and 
tumorigenic (d) populations contained cells with a neoplastic appearance, with large 

30 nuclei and prominent nucleoli. 

The tumorigenic population is capable of generating the phenotypic 
heterogeneity found in the initial tumor. The ability of small numbers of 
CD44 + CD24* /low Lineage" tumorigenic cells to give rise to new tumors was reminiscent 
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of the organogenic capacity of normal stem cells. Normal stem cells self-renew and 
give rise to phenotypically diverse cells with reduced proliferative potential. To test 
whether tumorigenic breast cancer cells also exhibit these properties; tumors arising 
from 200 ESA + CD44 + CD24 /,0W Lineage Tl or 1,000 CD44 + CD24- /low Lineage T2 
5 cells were dissociated and analyzed by flow-cytometry. The heterogeneous expression 
patterns of ESA, CD44 or CD24 in the secondary tumors resembled the phenotypic 
complexity of the tumors from which they were derived (Figure 7a, 7b vs 7e,7f). 
Within these secondary tumors, the CD44 + CD24 vl0W Lineage" cells remained 
tumorigenic, while other populations of Lineage" cancer cells remained non- 
10 tumorigenic (Table 3). Thus tumorigenic cells gave rise to both additional 

CD44 + CD24" /low Lineage" tumorigenic cells as well as to phenotypically diverse non- 
tumorigenic cells that recapitulated the complexity of the primary tumors from which 
the tumorigenic cells had been derived. These CD44 + CD24* /low Lineage" tumorigenic 
cells from Tl, T2 and T3 have now been serially passaged through four rounds of 
15 tumor formation in mice, yielding similar results in each passage with no evidence of 
decreased tumorigeneity. These observations suggest that CD44 + CD24" /low Lineage" 
tumorigenic cancer cells undergo processes analogous to the self-renewal and 
differentiation of normal stem cells. 

Figure 4 shows the phenotypic diversity in tumors arising from CD44+CD24- 
20 /lowLineage- cells. The plots depict the CD24 and CD44 or ESA staining patterns of 
live human Lineage- cancer cells from Tumor 1 (a, c and e) or Tumor 2 (b, d and f). 
Tl CD44+Lineage- cells (a) or T2 Lineage- cells (b) were obtained from tumors that 
had been passaged once in NOD/SCID mice. ES A+CD44+CD24-/lowLineage- 
tumorigenic cells from Tl (c) or CD44+CD24-/lowLineage- tumorigenic cells from 
25 T2 (d) were isolated and injected into the breasts of NOD/SCID mice. Panels (e) and 
(f) depict analyses of the tumors that arose from these cells. In both cases, the 
tumorigenic cells formed tumors that contained phenotypically diverse cells similar to 
those observed in the original tumor. 

Expression of Wnt pathway genes in subpopulations of breast cancer 
30 tumor cells. The Frizzled proteins are receptors for the growth/survival factors of the 
Wnt family. In some normal stem cells, Wnt is known to play a role in proliferation, 
survival and differentiation. In certain situations, stimulation of Wnt can promote 
stem cell self-renewal. Upon activation, Wnt induces the stabilization of p-catenin. 
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Flow cytometry using an antibody against P-catenin demonstrates that Tumor 1 cells 
express this protein (Figure 5). Immunohistochemistry shows that the P-catenin is 
located in the cytoplasm and the nucleus, indicating that the protein is active (data not 
shown). Different Wnt proteins specifically activate different frizzled receptors (44). 

5 Since the Wnt signaling pathway appears to play a critical role in proliferation of both 
normal and breast cancer cell proliferation (14,27), the expression of Wnt pathway 
genes in Tumor 1 tumorigenic cells and non-tumorigenic cells was examined (Figure 
5). To do this, one hundred ESA + B38.1 + CD24* /|0 LINEAGE" (tumorigenic) or non- 
tumorigenic tumor cells were isolated. RT-PCR using nested primers for each of the 

10 frizzled proteins was done. These results demonstrate that the tumorigenic cells 

expressed frizzled 2 and 6, while the non-tumorigenic cells expressed frizzled 2 and 7 
(Figure 5). These tests have been repeated twice with identical results. Next, 
members of the Wnt family expressed by the breast cancer cells were identified. 
RNA was isolated from 10,000 stem and non-tumorigenic cells. There are more than 

15 20 known members of the Wnt family, making it difficult to analyze expression of 
particular Wnts in breast cancer tumors. Therefore RT-PCR was performed using 
degenerate primers that recognize all known Wnt genes and cloned and sequenced the 
resultant cDNA. Surprisingly, we were able to detect expression of cDNA only by 
the non-tumorigenic cells (Figure 5). This was confirmed doing RT-PCR at the ten- 

20 cell level. Frizzled 6 expression was detected in nine of ten tumorigenic samples, and 
only one of ten non-tumorigenic cell samples. The cDNA was cloned, and 
sequencing revealed that these cells expressed Wnt 3 A, 4, 7 A, 7B, 10B, and 11. Wnt 
signals have been implicated in the growth of both breast cancer cells and normal 
endothelial cells. While not necessary to understand to practice the present invention, 

25 this suggests that the non-tumorigenic cells promote tumor formation both by 

stimulation of breast cancer stem cells and vessel formation via the Wnt pathway. 
This model fits very well with known observations that it is much easier to grow 
breast cancers using pieces of tissue as opposed to individual cells (22). 

Figure 5 shows the expression of Wnt (left panel) and Frizzled (right panel). 

30 In regard to the left panel, RT-PCR was done using degenerate Wnt primers with 
RNA isolated from 10,000 cells of the indicated type. + or - indicates whether RT 
was used. Right panel. RNA was isolated from one hundred breast cancer cells or 
breast cancer stem cells isolated by flow cytometry as described in figure 1 . RT-PCR 

99 

20O500S6O1A2 I > 



wo 



2005/005601 



PCT/US2004/018266 



was done using nested primers to detect the indicated mRNA. Control RT-PCR 
reactions omitting RT were negative. 

To confirm the RT-PCR results for the expression of frizzled proteins, an 
Affymetrix microarray was probed with cDNA made from Tumor 1, Tumor 2 and 
5 Tumor 3 cancer stem cells. All three tumors expressed Frizzled 2 & 6. In addition, 
Tumors 2 & 3 appeared to express frizzled 4. 

Isolation of normal cells from a tumor. Efforts were then made to determine 
whether sufficient normal cells could be isolated from a tumor to do molecular studies 
with these cells. Normal fibroblast and endothelial cells from a patient's tumor 

1 0 (approximately 3cm in size) were isolated by flow cytometry. 2% of the tumor cells 
were CD31 + endothelial cells and 8% were CD140b + fibroblasts (Figure 6). Nine 
thousand fibroblasts and two thousand endothelial cells were collected when 1/45 of 
the tumor was used for flow cytometry. By extrapolation, it would have been possible 
to isolate approximately 90,000 endothelial cells and 405,000 fibroblasts from the 

1 5 entire tumor. 

Figure 6 shows the isolation of normal tumor fibroblasts and endothelial cells. 
Tumors were dissociated as described in the methods section and tumor cells were 
stained with cytochrome labeled with antibodies against -CD2, -CD3, -CD16, -CD18 
,-CD45, -CD64, and anti-B38.1-APC (to eliminate hematopoietic cells and tumor 

20 cells respectively), anti-CD 1 40b-PE and anti-CD3 1 -FITC. A: the box shows the 
sorting gate for fibroblasts, which are Lineage" CD3T CD140b + cells. B: the box 
shows the sorting gate for endothelial cells, which are CD31 + Lineage" cells. 

Infection of breast cancer stem cells with an adenovirus vector. Since the 
xenograft tumors can only be grown briefly in tissue culture, conventional 

25 transfection methods are generally not useful for gene expression studies and only 
viral vectors have the potential to efficiently transduce the breast cancer stem cells. 
Therefore, the ability of adenovirus vectors to infect Tl breast cancer stem cells was 
tested. To do this, groups of 10,000 breast cancer stem cells or control MCF-7 cells 
were infected with 0, 50, 500, or 5,000 LacZ adenovirus particles. Figure 7 shows 

30 that we could easily transduce greater than 90% of the stem cells and they were more 
easily infected with the adenovirus vector than were the control MCF-7 cells. This 
demonstrates that we can use adenovirus vectors to transduce the stem cells with 
recombinant genes. 
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Figures 7 shows infection of breast cancer stem cells with an adenovirus 
vector. Flow cytometry was used to isolate CD44 + CD24* /low Lineage* cells. The 
Tumor 1 stem cells or control MCF-7 cells were infected with 0, or 500, or 5,000 
LacZ adenovirus particle/cell. Two days later, the cells were stained with X-gal. 
5 Note that the Tumor 1 stem cells were easily infected by the adenovirus vector. 

The following data is a description of work that has been done studying 
hematopoietic stem cells. It illustrates fundamental stem cell properties, and it also 
demonstrates how the isolation of stem cells enables one to first characterize these 
cells and then to do molecular and biochemical studies to functionally characterize 
10 them. 

Adult stem cell numbers are strictly regulated. The regulation of 
hematopoietic stem cell (HSC) homeostasis is not well understood. We screened for 
genetic polymorphisms that were linked to differences between mouse strains in the 
numbers of long-term reconstituting HSCs or restricted progenitors in the bone 

1 5 marrow. AKR/J mice had significantly higher frequencies and numbers of both HSCs 
and restricted progenitors in their bone marrow than C57BL/Ka-Thy- 1 . 1 mice. The 
C57BL/Ka-Thy-l.l alleles were partially dominant. A lccus on chromosome 17, 
including the H-2 complex, was significantly linked to the frequency of long-term 
self-renewing HSCs but showed no evidence of linkage to the frequency of restricted 

20 progenitors. Conversely, a chromosome 1 locus exhibited suggestive linkage to 
restricted progenitor frequencies but was not linked to HSC frequency. This 
demonstrates that there are distinct genetic determinants of the frequencies of HSCs 
and restricted progenitors in vivo. The AKR/J chromosome 17 locus was not 
sufficient to increase HSC frequencies when bred onto a C57BL background. This 

25 suggests that to affect HSC frequencies, the product(s) of this locus likely depend on 
interactions with unlinked modifying loci. The present invention demonstrates that 
stem cell expansion is under tight genetic regulation in an animal. 

Genomic analysis of hematopoietic stem cells. Hematopoietic stem cells 
(HSCs) have self-renewal capacity and multilineage developmental potentials. The 

30 molecular mechanisms that control the self-renewal of HSCs are still largely 
unknown. A systematic approach using bioinformatics and array hybridization 
techniques to analyze gene expression profiles in HSCs was done. To enrich mRNAs 
predominantly expressed in uncommitted cell lineages, 54 000 cDNA clones 
generated from a highly enriched population of HSCs and a mixed population of stem 
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and early multipotent progenitor (MPP) cells were arrayed on nylon membranes 
(macroarray or high-density array), and subtracted with cDNA probes derived from 
mature lineage cells including spleen, thymus, and bone marrow. Five thousand 
cDNA clones with very low hybridization signals were selected for sequencing and 
5 further analysis using microarrays on glass slides. Two populations of cells, HSCs 
and MPP cells, were compared for differential gene expression using microarray 
analysis. HSCs have the ability to self-renew, while MPP cells have lost the capacity 
for self-renewal. A large number of genes that were differentially expressed by 
enriched populations of HSCs and MPP cells were identified. These included 

1 0 transcription factors, signaling molecules, and previously unknown genes. 

Bmi-1 is required for HSC self-renewal. The gene expression analysis of 
HSCs allowed us to identify genes potentially important for self-renewal. After 
analysis of the gene expression data, we began mechanistic studies to identify 
important stem cell regulatory genes. A central issue in stem cell biology is to 

1 5 understand the mechanisms that regulate self-renewal of HSCs, which is required for 
hematopoiesis to persist for the lifetime of the animal. We found that adult and El 4. 5 
fetal mouse and adult human hematopoietic stem cells express the proto-oncogene 
bmi-1. The number of fetal liver HSCs, as measured by flow cytometry, was normal 
in loss of function bmi-1 mice, and the bmi-l^'HSCs were able to migrate normally 

20 towards a chemokine gradient. In post-natal bmi-V 1 ' mice, the number of HSCs, but 
not early progenitor cells was markedly reduced. Both fetal liver and bone marrow 
cells obtained from bmi-V 1 ' mice were able to contribute only transiently to 
hematopoiesis when transplanted into lethally irradiated recipients. There was no 
detectable self-renewal of adult hematopoietic stem cells, indicating a cell 

25 autonomous defect in bmi-V 1 ' mice. This study indicates that expression of bmi-1 is 
essential for the generation of self-renewing adult hematopoietic stem cells. See the 
manuscript by Park et aL, "Bmi-1 is required for maintenance of adult self-renewing 
hematopoietic stem cells" Nature (2003). 

Summary: The xenograft model developed by this laboratory has made 

30 possible the analysis of human breast cancer cells at the cellular level. Although 

cancer cell lines have proven useful for many studies, the cell lines are adapted to the 
unique conditions imposed by tissue culture and many of their properties clearly differ 
from the cancer cells in patients' tumors 91,109 . Recently, the size of primary breast 
cancer tumors prior to resection has markedly decreased. This has made biological 
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and biochemical studies using patient samples difficult. It is contemplated that the 
xenograft model described in the preliminary results ameliorates this problem. 
Preliminary results suggest that the xenograft tumors appear to recapitulate the 
phenotypic and biological diversity seen in the original patients' tumors. Although 
5 there may be some differences in the mouse and human tumors due to environmental 
factors, the NOD/SCID model described here is the best available model of human 
breast cancer. Results demonstrate that breast cancer cells reliably engraft in this 
xenograft model and in the early passages reflect the cellular and biological diversity 
found in the original human tumor. These tests also show that different populations of 
1 0 cancer cells may differ in their ability to form tumors. 

Example 2 

Characterizing the Wnt/p-catenin Pathway in Human Breast Cancer Tumors 

This examples describes how one could characterize the Wnt/p-catenin 

1 5 pathway in human breast cancer tumors using the xenograft model described above. 
The Wnt/p-catenin pathway plays a role in the proliferation and self-renewal of 
normal stem cells. Although a significant percentage of human breast cancers appear 
to have constitutive activation of this critical pathway, unlike colon cancer, it has not 
been definitively established what role this pathway plays in the pathology of this 

20 disease in humans 84 * 89 . The xenograft model described above may be used to 

characterize the biological consequences of this pathway in human breast cancer 
tumors. These tests are done using cancer cells directly after removal from patients 
and early passage xenograft tumors. 

The function of the Wnt/frizzled/B-catenin signaling pathway in multiple 

25 patients' tumors. Rationale: Almost 90% of colon cancers contain mutations that 
result in activation of p-catenin. The most common mutations are in the APC gene, 
which is involved in targeting P-catenin for degradations or mutations in the P- 
catenin protein itself 43,1 1 1 . These latter mutations prevent degradation. Although the 
cancer cells in many breast tumors appear to have constitutively acitve of P-catenin 84 , 

30 in contrast to colon cancer, mutations in the APC gene or p-catenin itself account for 
only 6-10% of these cases 84 * 89 . Examination of the Wnt/p-catenin signaling pathway 
in breast cancer cells should lead to new insights into the pathogenesis of this disease. 
There are a large number of Wnt proteins that are thought to differentially bind to 
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different Frizzled receptors 43 ' 65,69 ' 78,112 . Only a subset of Writs, and by inference 
Frizzled receptors, can activate P-catenin. Normally, p-catenin is bound to E-cadherin 
at the cell membrane. Cytoplasmic P-catenin forms a complex with the APC and Axin 
proteins and facilitates p-catenin phosphorylation by GSK3p 87,88,1 1 ! . The 
5 phosphorylated P-catenin is then degraded via the ubiquitin degradation pathway. 
However, upon activation of Sizzled receptors by a Wnt, p-catenin is stabilized. The 
protein then translocates to the nucleus where it forms a complex with the 
LGLS/BCL9, PYGO and TCF proteins to activate transcription 1 13,1 14 . We believe that 
our xenograft model and cellular assays are unique and powerful tools for 

10 understanding this critical pathway. We analyze 10 tumors that have constitutive P- 
catenin signaling and 10 that do not. These studies give new insights into the 
mechanisms by which the Wnt pathway is activated and the consequences of this 
activation in human breast cancer. 

In mice, ectopic expression of various Wnt proteins results in breast tumor 

15 formation, while in humans activated P-catenin in breast cancer cells is associated 
with expression of cyclin Dl and poor prognosis 78,84,1 15,1 16 . However, it is not known 
whether continuous P-catenin signaling is necessary for tumorigenic breast cancer 
cells to form tumors. There are several possible roles that constitutive P-catenin 
signaling can play in human breast cancer. First, it can be necessary for continued 

20 proliferation and/or viability of the tumorigenic cancer cells. Next, it can be necessary 
for the initiation of the tumor, but subsequent mutations bypass the need for p-catenin 
signaling. Third, it may make the cancer cells more resistant to chemotherapy due to 
the activation of downstream targets such as cyclin Dl. Fourth, constitutive P-catenin 
signaling accelerates cancer cell growth, but is not necessary for tumorigenicity. 

25 Finally, the role of P-catenin signaling in tumor formation might differ in tumors with 
and without constitutive activation of P-catenin. For example, the former tumors 
might require p-catenin signaling whereas the latter tumors might require Wnt signals 
from other tumor cells or they might be independent of p-catenin because they have 
constitutive activation of downstream targets such as c-myc and/or cyclin Dl. The 

30 tciti described hero are designed to distinguish between these possibilities using a 

novel xenograft model of human cancer. The data ehows that the xenograft model 

virtually recapitulates a human breast tumor. Thus, this model allows us to study the 

Wnt pathway in de novo human tumors in as physiological conditions as possible. 



WO 2005/005601 PCT/US2004/0 18266 

Is P-catenin signaling required for tumor formation by cancer cells 
isolated from multiple pateints? The tests here determine whether the p-catenin 
pathway is obligate for breast cancer cell growth or whether activation is not required 
for tumor formation but does increase the rate of proliferation of the cancer cells. 

5 Although the xenograft tumors appear to closely resemble human tumors, over time 
selection pressure result in tumors that are adapted to the mouse environment. The 
cancer cells in such tumors differ in some ways with the cancer cells that made up the 
original human tumors. We identify cancer cells from five different xenograft tumors 
and five unpassaged tumors that have activated p-catenin (cytoplasmic and/or nuclear 

10 expression by immunohistochemistry) and cancer cells from five xenograft tumors 
and five unpassaged tumors that do not (membrane-associated expression by 
immunohistochemistry). We select tumors that are heterogeneous for important 
prognostic features that include estrogen receptor/progesterone receptor (ER/PgR), 
primary tumor vs. metastatic tumor, wild type vs. mutant p53, and amplification of 

15 Her2/neu. 

To identify cells that have constitutive activation of p-catenin, we take 
advantage of the observation that this results in stabilization of P-catenin and 
accumulation of the protein in the cytoplasm and nucleus. When not activated, P- 
catenin is associated with the plasma membrane. We therefore analyze the breast 

20 cancer cell population from each of the tumors using immunohistochemistry to 
determine the sub-cellular localization of P-catenin and using flow cytometry to 
determine the amount of P-catenin expressed by each population of cells. To do this, 
we use flow cytometry to isolate the Lineage" cancer cells from multiple tumors. 
Viably frozen xenograft or patient tumor cells are used for this analysis. The cancer 

25 cells then are stained with an anti-P-catenin-FITC antibody for immunohistochemistry 
and flow cytometry analysis using the antibody manufacturer's protocol 
(Transduction Laboratories). Cells with activated P-catenin have cytoplasmic/ nuclear 
localization and increased levels of the protein. 

To determine the role of P-catenin signaling in tumorigenesis, Lineage" cancer 

30 cells isolated from each of the 20 tumors are infected with either an adenovirus vector 
or a lentivirus vector that contains a dominant-negative (dn) TCF4-IRES-GFP 
minigene or a control GFP virus (for details of virus construction and use, see 117 ). The 
adenovirus vector express the dnTCF4 transiently for 1-3 weeks, while the lentivirus 
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vector express the dnTCF4 permanently. The dnTCF4 adenovirus has already been 
made using a dnTCF4 minigene (a gift from Eric Fearon). The dnTCF4 forms a 
complex with p-catenin thereby inhibiting transcriptional transactivation by the 
activated p-catenin. Note that the dnTCF4 blocks signaling from all members of the 
5 TCF family that mediate P-catenin signaling (Eric Fearon, personal communication). 
Limiting dilution tests are done to determine the ability of the transduced cells to form 
colonies in vitro and tumors in vivo. The tests here are done using cancer cells isolated 
from either patient or human tumors by flow-cytometry. By eliminating the lineage 
cocktail to eliminate the normal cells, colony formation in tissue culture and tumor 

1 0 formation in mice by cancer cells can be measured (The possible contributions of 
normal stromal cells to the growth of tumorigenic cells are analyzed as described 
below in aim 2B). To determine the role of P-catenin signaling on cancer cell growth 
and viability, five sets of 1,000, 5, 000, 20,000, 50,000 and 100,000 Lineage" cancer 
cells from each of the tumors infected with the dnTCF4 viruses (either the adenovirus 

1 5 or lentivirus vectors) and control viruses are cultured in vitro in medium containing 
the Notch ligand Delta and the number of colonies that form are determined. The 
colonies in a control tissue culture plate are stained with cytokeratin to confirm that 
they arose from neoplastic cells 96 . Two days after infection the cells are examined 
with a fluorescent microscope to confirm that greater than 90% of the cells were 

20 transduced by the virus. Similarly, in vivo limiting dilution tests are done to determine 
whether the dnTCF4 viruses affect tumor formation by the cancer cells isolated from 
the different patients. After infection, ten sets of 5, 000, 20,000, 50,000 and 100,000 
Lineage* cancer cells are isolated by flow-cytometry and then infected with the 
dnTCF4 adenovirus or control adenovirus. The infected cells are injected into the 

25 breast of NOD/SCID mice. We then determine the number of cancer cells needed to 
form tumors in each group, the time needed to form tumors in each group, the rate of 
growth of each group, and the size of the tumors that form in each group. This allow 
us to determine whether P-catenin is necessary for tumor formation by cancer cells 
that do or do not have constitutively activated P-catenin. 

30 Subsequent tests depend on the results of the in vivo and in vitro limiting 

dilution tests. If inhibition of P-catenin transcriptional transactivation blocks tumor 
formation or slows tumor growth, then we begin to test whether downstream p- 
catenin targets such as cyclin Dl or c-myc are required for tumorigenicity 84,85,118 . To 
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do this, we infect the Lineage- cancer cells isolated by flow-cytometry and infect the 
cells with either the control or dnTCF4 adenovirus as well as a control gfp vector, a c- 
myc-IRES-gfp retrovirus vector, a cyclin Dl-IRES-rlp retrovirus vector, or both the 
myc-IRES-gfp and the cyclin Dl-IRES-rfp retrovirus vectors. Infected cells are 

5 isolated by flow cytometry, and then ten sets of 5,000, 1 0,000, 20,000, 50,000 or 

100,000 Lineage- cancer cells of each test group are injected into mice. The mice are 
analyzed weekly for the formation of tumors, and the rate of growth of each test 
group. This allows us to determine whether enforced expression of either c-myc 
and/or cyclin rescues the cells from inhibition of P-catenin signaling. 

10 If inhibition of P-catenin does not have any discernable effects on tumor 

formation, we first confirm that both of the dominant-negative viruses are inhibiting 
expression of the dnTCF4 minigene. If not, we use another method to inhibit the p- 
catenin pathway. In addition to RNA-i and antisense approaches 119 " 122 , overexpression 
of Axin (which targets p-catenin for degradation) can be used to inhibit p-catenin 66,75 . 

15 If p-catenin signaling was inhibited and there was minimal or no effect on tumor 
formation, then we determine whether there are more subtle changes on the cancer 
stem cells. Expression of cyclin Dl, whose expression is induced by P-catenin, has 
been associated with resistance to chemotherapy. Therefore, we treat mice with 
Adriamycin (8mg/kg) or Taxol (60mg/kg) five days after the dnTCF4-transduced or 

20 control cancer stem cells were injected into mice to determine whether inhibition of p- 
catenin enhance the efficacy of chemotherapy. The effect on tumor formation and 
tumor growth rate is determined as described above. 

Expected results. Although cancer cells in a significant number of breast 
tumors have a constitutively active P-catenin signaling pathway, it is not known 

25 whether this pathway is essential for malignant transformation. If the Wnt/p-catenin 
pathway is necessary for the cancer cells to form tumors, then dominant-negative 
inhibitors block the ability of cancer cells to form tumors. If constitutive p-catenin 
signaling enhances tumor cell growth after malignant transformation but is not 
necessary for tumor formation, then the dominant-negative inhibitor slow growth of 

30 the tumor cells but not block tumor formation. If oncogenic mutations subsequent to 
tumor initiation make the cells independent of Wnt signaling, then the dominant- 
negative inhibitor do not affect tumor formation or growth. Finally, it is possible that 
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constitutive activation of the Wnt pathway contributes to resistance to apoptosis and 
therefore makes the cells resistant to chemotherapy. 

In a model of mouse cancer, a brief inhibition of c-ras or c-myc activity in 
cancer cells transformed by these genes resulted in a permanent loss of 
5 tumorigenicity 123,124 . If this is also true for p-catenin signaling, then transient 

inhibition of signaling by the adenovirus inhibit tumor formation. If inhibition of p- 
catenin signaling inhibits tumorigenicity, but the cells remain viable and restoration of 
p-catenin signaling enables them to form tumors, then the adenovirus vector slow 
tumor formation wherease the lentivirus vector inhibit tumor formation. If P-catenin 

10 signaling increases the rate of proliferation but is not obligate for tumorigenicity, then 
both viral vectors delay tumor formation and slow the growth of the tumors. If some 
tumors rely on p-catenin signaling and others rely on other pathways or have 
constitutive activation of downstream effectors of P-catenin signaling, then some 
tumors are affected by the viral vectors while others do not. The tests described in this 

1 5 aim allow us to answer these critical questions using a unique model recapitulates 
human tumors. These tests for the first time delineate the biological function(s) of p- 
catenin signaling in de novo human breast cancers. 

The lentivirus can be made using other envelopes until one is found that 
infects the cells efficiently 125 ' 126 . 

20 Note that with the lentivirus vector, infection efficiency may only be in the 

range of 30-70%. This would mean that a significant number of tumor cells would 
remain that could form tumors. However if inhibition of P-catenin signaling inhibits 
tumor formation, then the resultant tumors would not express gfp. Flow cytometry is 
used to measure gfp-expressing cells in the tumors infected with the dnTCF4 and 

25 control viruses. The tumors arising from the dnTCF4 group have a marked decrease in 
such cells if p-catenin signaling does play a role in tumor formation. 

Does inhibition of p-catenin signaling alter the phenotype of tumorigenic 
breast cancer cells? One of the informative markers useful for the separation of 
tumrorigenic and non-tumorigenic breast cancer cells is CD44. Interestingly, CD44 is 

30 one of the target genes that is transcriptionally upregulated by P-catenin and epithelial 
stem cells, but not their differentiated progeny, are felt to express this marker 83,127 . 
We contemplate that inhibition of p-catenin signaling result in the differentiation of 
the tumorigenic breast cancer cells and cause them to lose expression of CD44. We 
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further contemplate that the CD44* non-tumorigenic cancer cells do not have active P- 
catenin. To test this, we use flow-cytometry to isolate ESA + CD44 + CD24" /low Lineage- 
cancer cells from Tumor 1, Tumor 2 and Tumor 3 and infect them with the dnTCF4 
adenovirus or a control adenovirus. The cells are cultured in tissue culture medium 
5 containing soluble Delta. We have found that this medium allows the tumorigenic 
cells to grow in tissue culture for 1-3 weeks. The cells are monitored for growth in 
vitro over a 3-week period. In addition, 1, 3 and 7 days after infection, the dnTCF4 
adenovirus or a control adenovirus infected cells are analyzed by flow-cytometry for 
the expression of ESA, CD44 and CD24. 

10 Next, we determine if there is a difference in P-catenin signaling in the 

tumorigenic cancer cells, the CD44 + cancer cells, or the CD44* cancer cells. To do 
this, we use flow-cytometry to isolate ESA + CD44 + CD24" /low Lineage- tumorigenic 
cancer cells, CD44 + cancer cells, and CD44" non-tumorigenic cancer cells from tumor 
1, tumor 2 and tumor 3. Each population of cells are stained with an anti- P-catenin 

15 antibody that has been conjugated with APC. Each population of cells are analyzed by 
fluorescent microscopy to determine whether the p-catenin is membrane bound (not 
constitutively active), and by flow-cytometry to determine the amount of the protein 
in the cells. The level of p-catenin is associated with activity. In addition, we use 
commercially available antibodies that recognize phosphorylated and 

20 unphosphorylated P-catenin. The phosphorylated form is marked for degradation 
while the unphosphorylated form is active 128,129 . These tests allow us to determine 
whether CD44 expression and P-catenin signaling are linked in patients' cancer cells. 

CD44 is one of the best markers that allows one to distinguish tumorigenic 
cancer cells from non-tumorigenic cancer cells. Since CD44 is transcriptionally 

25 activated by P-catenin, then inhibition of P-catenin signaling result in downregulation 
ofCD44. 

Does the differential expression o f the fril led proteins affect breast 
cancer stem cell fate in Tumor 1? Data suggest that in Tumor 1, the tumorigenic 
stem cells express frizzled 2 and 6, whereas the non-tumorigenic neoplastic cells 
30 express Wnt 3, 4, 7A, 7B, 1 0B, and 1 1. This suggests a paracrine system in this 

particular tumor where the non-tumorigenic cells might drive the proliferation of the 
cancer stem cells. Preliminary data also suggest that in Tumor 1, the tumorigenic stem 
cells express frizzled 2 and 6, whereas the non-tumorigenic neoplastic cells express 
frizzled 2 and 7. It is possible that differential expression of frizzled genes plays a role 
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in cancer cell fate decisions. The other possibility is that differential expression of 
these genes is a function of differentiation or immortality but does not directly 
regulate cell fate decisions in this tumor. This sub-aim distinguish between these 
possibilities. 

5 Tumor 1 tumorigenic cells express frizzled 6 and non-tumorigenic cancer cells 

express frizzled 7. It is possible that frizzled 6 enhances and frizzled 7 inhibits the 
proliferation or self-renewal of the cancer cells.lTo test this possibility, in vitro and in 
vivo clonogenic assays are done. Tumor 1 tumorigenic and non-tumorigenic cancer 
cells are infected with a lentivirus vector that expresses either frizzled 6-IRES-GFP or 

10 frizzled 7-IRES-GFP. A lentivirus is used rather than an adenovirus since the former 
virus can infect and stably transduce a high proportion of primary cells, whereas 
adenovirus transduction is often transient. It is conceivable that expression of frizzled 
6 confers the ability to self renew to the cancer cells. If so, infection of stem cells 
and/or non-tumorigenic cells with a lentivirus vector containing a frizzled 6-IRES- 

1 5 GFP minigene may enhance tumorigenicity of the stem cell or allow the previously 
non-tumorigenic cells to form tumors. Conversely, enforced expression of frizzled 7 
may inhibit tumorigenicity. After infection with either the frizzled or control virus, 
limiting dilution tests are done to determine whether enforced expression of each gene 
alters the ability of each population of cancer cells to form tumors. 

20 These tests allow us to determine whether enforced expression of frizzled 6 

increases stem cell proliferation and/or self-renewal or expression of frizzled 7 
inhibits tumorigenic cancer cell proliferation and/or self-renewal. To test this 
possibility, we isolate the tumorigenic ES A + CD44 + CD24* /low Lineage" cancer cells and 
the other Lineage", non-tumorigenic cancer cells are isolated by flow-cytometiy from 

25 each of the tumors. First, immunohistochemistry are done using the anti-P-catenin 
antibody to determine whether there is a difference in the amount of active P-catenin 
in the tumorigenic and non-tumorigenic cells. Next, we determine the amount of 
phosphorylated (inactivated) and non-phosphorylated (active) p-catenin the 
tumorigenic and non-tumorigenic cells 128 . 

30 Next, in vitro assays are designed to determine the affects of each gene on 

colony formation by tumorigenic and non-tumorigenic cancer cells in tissue culture. 
After isolation by flow cytometry, each population of cells are infected with an 
identical MOI of either the frizzled 6 /GFP, frizzled 7/GRP or a control GFP virus. 
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Triplicate cultures of 100, 500, 1,000 and 5,000 cells are placed in tissue culture 
medium. The total number of GFP + colonies as well as the total number of colonies 
and the number of GFP + colonies are counted on days 3, 7, 14, 21 and 28. At the end 
of 21 days, we attempt to pass the cells to determine whether expression of the 
5 particular frizzled gene affects self-renewal. 

The influence of enforced expression of each frizzled gene on the ability of the 
neoplastic cells to form tumors in the NOD/SCID mice are determined. Normally, 200 
Tumor 1 cells are required to form a tumor. Therefore, the frizzled 6, frizzled 7 or a 
control GFP lentivirus are used to infect 50, 100, 500, 1,000, 5,000, and 10,000 

10 tumorigenic cancer cells or non-tumorigenic cancer cells. The cells are injected into 
the immunodeficient mice. The number of cells needed to form tumors and the rate of 
tumor growth are monitored. After the tumors have reached one centimeter in size, 
they are excised and analyzed by flow cytometry for expression of GFP. By 
comparing the percentage of cells infected by the GFP virus and frizzled /GFP virus, 

15 we are able to estimate the efficiency of infection and the affect of the latter virus on 
proliferation. These tests are replicated three times. 

Predicted Results : In Tumor 1, different populations of cells express different 
frizzled proteins, and the non-tumorigenic cells appear to preferentially express Wnt 
proteins. This suggests that certain populations of non-tumorigenic cells promote 

20 tumor formation through Wnts. If p-catenin signaling is downregulated in the non- 
tumorigenic cells and active in the tumorigenic subset, this is detected by the 
immunohistochemistry analysis of the expression patterns of phosphorylated & 
unphosphorylated P-catenin in the non-tumorigenic and tumorigenic cancer cells 
respectively. If there is no affect of the particular frizzled/GFP virus, then a similar 

25 percentage of cells would express GFP in each group and there is no difference in the 
number of cells needed to form a tumor. If the particular frizzled virus decreases or 
increases tumorigenicity or proliferation, then tumors infected with frizzled/GFP virus 
would have fewer or more GFP + cells and/or would require more or fewer cells to 
form tumors, respectively. 

30 If necessary, a Feline Leukemia Virus lentivirus based vector system is used. 

This latter vector efficiently transduces non-replicating cells, and results in prolonged 
expression of transgenes. We can infect cells using a tet-inducible dnTCF-IRES-GFP 
lentivirus flanked by gene insulators or a control GFP lentivirus. 1-2 days prior to 
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harvesting tumors, the transgene is activated. GFP + breast cancer stem cells are 
harvested and transplanted into the breasts of NOD/SCID mice. We continue to 
induce the expression of the transgene in the mice, and we are able to monitor them 
for the ability of the cells to form tumors. 
5 The P-catenin signaling pathway differs in the cancer cells isolated from 

cancers with and without constitutivelv activated fccatenjn. Rationale: Unlike 
colon cancer, mutations in the p-catenin signaling pathway have been detected in only 
a minority of breast cancer cells. However, these studies have concentrated only on 
APC and p-catenin. In this aim, we closely examine the p-catenin pathway in each of 

10 the tumors that were analyzed at the biological level in specific aim 1 A. 

Does the Wnt pathway differ in cancer cells isolated from different 
tumors ? In these experiments, we characterize the Wnt/p-catenin pathway in each of 
the tumors. To do this, we use RT-PCR to amplify the coding sequence of P-catenin, 
each of the frizzled proteins, the low-density lipoprotein-related Wnt receptors, APC, 

1 5 TCF family members, Axin, and Bcl-9 expressed by the cancer cells from each of the 
10 tumors with constitutive activation of p-catenin, RT-PCR products of the 
expressed genes are sequenced to determine whether there are mutations in any of the 
genes. Any possible mutant genes are confirmed by repeated sequencing of an 
independent RT-PCR sample. If mutations are found, we determine whether the 

20 mutations result in the constitutive activation of the Wnt/p-catenin pathway. To do 
this, the mutated gene-IRES-GFP are cloned into the pCDNA3 eukaryotic expression 
vector. For example, if we find a mutant frizzled 2, then HEK 293 cells (which do not 
have activated B-catenin 130 ) are transfected with the mutant frizzled 2-IRES-GFP 
expression vector or a control IRES-GFP vector. Cells are stained with an anti-p- 

25 catenin-PE antibody and fluorescent microscopy is done to determine whether the 
mutant frizzled 6 causes cytoplasmic/nuclear localization of p-catenin, indicating 
activation of signaling. This assay allow us to determine whether mutation of 
components of the P-catenin pathway result in aberrant signaling in human breast 
cancer stem cells. 

30 Expected results. Although constitutively active p-catenin is seen in the 

cancer cells in a significant number of breast cancer tumors, the mechanism is not 
known. There are differences in the signaling pathway in different tumor cells that are 
detected by these studies. If a mutation in a Wnt receptor or p-catenin modifier is 
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present, then the sequencing studies detect this difference. If autocrine stimulation is 
present, then we see expression of one of the Wnt ligands by the cancer cells. 

Does Wnt expression bv different populations of tumor cells in some 
tumors drive breast cancer cell growth? Perhaps more so than any other type of 
5 cancer, a breast cancer tumor contains a heterogeneous population of normal cells 
including mesenchymal (stromal) cells, inflammatory cells, and endothelial cells that 
interact with malignant cells to modulate tumor growth and invasion. The purpose is 
to begin to understand the role of the Wnt pathway in such interactions. We 
contemplate that normal stromal elements including mesenchymal and endothelial 

10 cells produce different Wnts that influence tumor cell proliferation and invasion. Just 
unpassaged tumors are analyzed since the xenograft tumors would be expected to 
have infiltrating normal mouse stromal cells and analysis of the mouse cells would be 
too complicated. Purification of these cells by flow-cytometry allow both molecular 
and biological analysis of these cells without first placing the cells in tissue culture. 

15 This is particularly important since the normal cells are known to change expression 
of genes when cultured in vitro. 

The normal stromal cells are thought to play a role in the proliferation of 
breast cancer cells. It is also likely that the cell-cell interactions between cancer cells 
contribute to tumor growth. Wnt signaling is one of the major pathways that normal 

20 tissue cells use to talk to each other. Therefore, it is important to understand how this 
pathway is regulated in tumors. Specific Wnt proteins can activate specific frizzled 
receptors. Some frizzled receptors signal through P-catenin, while others signal 
through different pathways. To understand how the various populations of tumor cells 
within a tumor might talk to the tumorigenic breast cancer cells through this pathway, 

25 we must first determine which frizzled and Wnt genes are expressed by the normal 
cells and the cancer cells from multiple patients* tumors. Therefore, we identify the 
Wnt pathway genes that are expressed by each population of normal cells and the 
cancer cells isolated from the 5 patients' tumor samples that have constitutive P- 
catenin signaling and the cancer cells from 5 patients' tumors that do not have 

30 constitutive activation of this protein. 

Since our evidence suggests that there are differences in the expression of Wnt 
and frizzled genes in the different populations of cancer cells, it is important to isolate 
the different phenotype subsets of cells in the cancer to do these tests. This is because 
the apparently tumorigenic population of cells is a minority population, and the genes 
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that these cells express might otherwise be missed in the analyses. Therefore, flow- 
cytometry is used to isolate tumorigenic and non-tumorigenic breast cancer cells, as 
well as normal endothelial cells and fibroblasts from the patients' original tumor. This 
is done as described in preliminary results and aim 1 . RNA is isolated from pools of 
5 35,000 of each population of cells and then linear amplification is done to make 
sufficient probe for the microarray analysis 131 " 135 . To determine which frizzled and 
Wnt genes are expressed by the each population of cells found in each tumor, we 
probe an affymetrix microarray chip (3 chips for each cell type) that includes the Wnt 
and frizzled genes (the newly released U133 chip has the majority of these genes). 

10 Results are confirmed by quantitative RT-PCR of the different populations of 

cancer cells isolated from the primary tumors with and without activated 0-catenin in 
the cancer cells. Real time RT-PCR is done to determine the level of expression of 
each of the frizzled and Wnt genes by different populations of normal and neoplastic 
tumor cells. To do this, we make PCR primers for detection each of these genes. Each 

15 set of primers span at least one exon so RT-PCR can be used to detect expression of 
the mRNA in different populations of tumor cells. Flow-cytometry is used to isolate 
the tumorigenic population of cells identified in each of the tumors. Real-time PCR 
then is used to measure the expression of each of the Wnt pathway-related RNAs by 
each respective cell population identified in the microarray analysis (reviewed in 136 ). 

20 To do the real-time PCR gene expression analysis, mRNA is purified from 3xl0 4 cells 
(isolated by flow-cytometry). Part of the RNA is used to directly measure RNA 
amount by the Ribogreen RNA quantitation method (Molecular Probes, Eugene, OR), 
and part used to measure rRNA and GAPDH expression (a control housekeeping 
gene) via the Taqman real-time RT-PCR assay. Taken together, these control 

25 measurements allow us to normalize expression of the genes of interest between the 
different populations of cells 136 . Although fewer cells may be used in this assay, 
analysis of RNA isolated from 3xl0 4 cells should result in a more accurate 
measurement of gene expression. 

Each frizzled receptor expressed by the different populations of cancer cells 

30 from each tumor is analyzed for the ability to activate p-catenin and transform cells 
when stimulated by each of the different Wnt genet that are expressed by different 

population! of colli within a tumor. Two biological lyitemi are uied for those studies. 

First, we use HfiK 293 cells transacted with eaoh individual mealed identified in this 
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screen to test the ability of the identified Wnts to activate P-catenin through the 
frizzled proteins expressed by the tumorigenic cells. Next, we use a mammary 
epithelial cell line to determine whether a particular Wnt or frizzled gene is able to 
transform the cell line. 
5 To measure the biochemical functions of the different Wnt and Frizzled 

proteins expressed by the breast cancer cells, we use a transient transfection assay as 
described by Gazit et al. 130 . In this assay, HEK 293T cells are transiently transfected 
with a frizzled minigene or a control minigene and aTCF-luciferase or control reporter 
minigene. To test the ability of a particular Wnt protein to stimulate p-catenin 

10 signaling, a second group of HEK 293T cells are transfected with each of the Wnt 
genes expressed by the various populations of tumor cells. The frizzled-transfected 
cells are mixed with the Wnt-transfected cells to measure paracrine activation of a 
particular frizzled receptor expressed by the breast cancer stem cells activates P- 
catenin when stimulated by a particular Wnt protein expressed by one of the various 

1 5 populations of tumor cells. 

The C57MG cell line is used to determine whether activation of particular 
frizzled receptors by particular Wnts causes morphological transformation 137 . These 
cells undergo morphologic transformation when exposed to Wnt-1, Wnt-2, Wnt-3A, 
Wnt-6 and Wnt-7A, but not Wnt4, Wnt-5A, Wnt-5B and Wnt-7B. These data suggest 

20 that the non-transforming Wnts signal differently than the transforming Wnts, or that 
they signal through different receptors not expressed by the C57MG cells. Therefore, 
to fully characterize the functions of the different frizzled and Wnt proteins expressed 
by the cancer cells in the patients' tumors, we must first determine which frizzled 
genes are expressed by the C57MG cells. The cells are transfected with minigenes 

25 that express any frizzled genes expressed by tumorigenic breast cancer cells but not 
expressed by the C57MG cells. Next, cells are cultured in the presence of lethally 
irradiated fibroblasts or HEK 293T cells transfected with individual Wnt genes that 
were expressed by the different populations of tumor cells. The cells are analyzed for 
morphological transformation as described by Shimizu 138 . 

30 Next, we characterize the in vivo response of cancer cells from the different 

patients' tumors to different Wnts made by the tumor cells. The Wnt proteins are 
often found in the extracellular matrix and difficult to prepare in soluble forms. 
Therefore, we make control HEK 293 cell lines that express each of the Wnts made 
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by the various types of tumor cells present in 2 patients' tumors. To do this, we first 
analyze HEK 293 cells to determine whether they constitutively make any of the Wnt 
proteins. Next, we stably transfect the HEK 293 cells with each of the Wnts made by 
the patients* tumor cells. To determine the affect of Wnt stimulation in the breast 
5 cancer cells by each of its ligands in vivo, 0, 10, 50, 100, 200, 500 and 1,000 Tumor 1 
stem cells are mixed with 500,000 lethally irradiated control 293 cells or 293 cells 
transfected with one or more relevant Wnt minigenes and then injected into 
immunodeficient mice. Each injection is done in five mice. The mice then be 
monitored weekly for tumor formation. If a particular Wnt stimulates self renewing 

10 cell division, then either fewer cells are needed to initiate a tumor and/or tumors form 
more quickly. Conversely, if the ligand induces commitment to differentiation, then 
more cells are required to form a tumor and/or tumors take longer to form. 

Expected results. The interaction of cancer cells with the normal stromal cells 
in tumors is thought to be critical for tumor formation and metastasis 33 . The Wnt 

1 5 pathway is one of the central pathways by which cells in normal tissues 

communicate 65 . It is therefore likely that such communications are maintained to 
some extent in tumors. The models described in this proposal for the first time enable 
such studies to be conducted using patients' tumor cells. If the stromal cells indeed 
promote tumor growth through Wnt signaling, then the various populations of stromal 

20 cells make specific Wnts that provide a proliferative signal for the tumorigenic cancer 
cells. 

To minimize these problems, all tests are done in triplicate with different 
numbers of cells. Expression of a control RNA of a known quantity is used to 
construct a standard curve to analyze the data(reviewed in 136 ). If necessary, new PCR 

25 primers are made, or RT is done with gene specific primers recognizing a different 
part of the mRNA (oligo dT primers are used for the RT reaction initially). 

Summary: These tests for the first time describe in comprehensive detail the 
molecular mechanisms by which the P-catenin pathway is activated in vivo in 
tumorigenic populations of breast cancer cells obtained directly from multiple 

30 patients' tumors, and the biological consequences of this activation in de novo breast 
cancer cells, 
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Example 3 

Localizator of p-catenin in Tumorigenic Cells 

In normal hematopoietic cells, nuclear P-catenin is found only in the stem cell 
compartment. Reya et al further demonstrate that P-catenin signaling is necessary for 
5 normal stem cells to self-renew. A recently completed analysis of the subcellular 
localization of P-catenin in tumorigenic and non-tumorigenic tumor 1 breast cancer 
cells further supports this notion. Normally, the subcellular distribution of p-catenin 
is heterogeneous in cancer cells. In some cells, the protein is located primarily in the 
outer membrane, while in others primarily in the nucleus. The subcellular distribution 

10 of the protein differs in the tumorigenic and non-tumorigenic cancer cells. The P- 
catenin is primarily located in the cytoplasm of the non-tumorigenic cancer cells, 
while it is primarily in the nucleus of the tumorigenic cells (Fig. 8). Since upon 
activation by a Wnt signal, P-catenin translocates from the cell membrane to the 
nucleus to activate downstream target genes, this data supports the hypothesis that 

15 Wnt signaling plays a role in the self-renewal of breast cancer stem cells. 

Figure 8 shows subcellular localization of p-catenin. A FITC labeled anti-P- 
catenin antibody was used to stain (A) colon cancer cells, which have a constitutively 
activated P-catenin, (B) non-tumorigenic Tl breast cancer cells, and (C) tumorigenic 
breast cancer cells. The tumorigenic and non-tumorigenic cancer cells were isolated 

20 by flow cytometry as described in the PNAS manuscript by Al-Hajj et aL Note that 
the P-catenin is located primarily in the nucleus of the colon cancer cells and the 
breast cancer stem cells, but it is primarily located on the surface of the non- 
tumorigenic cells. 

To begin to understand the biological consequences of p-catenin signaling in 
25 breast cancer, we have tested our dominant negative TCF-4 (dTCF4) adenovirus 
vector in several cell lines. This adenovirus acts to inhibit p-catenin signaling. Two 
different breast cancer cell lines, SKBR3 and MCF7, and a gastrointestinal tract 
cancer cell line, RKO, were infected with the dTCF4 adenovirus or a control 
adenovirus (empty vector). Four days after infection, the number of viable cells in 
30 each group was determined. As shown in Figure 9, the breast cancer cells infected 

with the dTCF4 adenovirus, but not the control adenovirus, died. These data show that 
the Wnt pathway does play a role in human breast cancer. 
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Figure 9 shows inhibition of P-catenin signaling in cancer cells. Triplicate 
cultures of SKBR3 cells (A), MCF7 cells (B) and RKO cells (C) were infected with 
either an control adenovirus (empty vector) or an adenvovirus vector that expresses a 
dominant-negative TCF4 minigent (dTCF4). With increasing virus concentrations, 
5 SKBR3 cells and MCF7 cells, but not RKO cells, lost viablility. Note that the virus 
titers resulting in cell death were those needed to efficiently infect most of the target 
cells with a control GFP virus (data not shown). This experiment has been repeated 
with similar results. 

The observation that p-catenin is located primarily in the nucleus in the tumorigenic 
1 0 but not the non-tumorigenic cancer cells taken together with the observation that 

inhibition of P-catenin signaling affects the viability of some breast cancer cell lines 
shows that like normal stem cells, Wnt signals may play a role in the self-renewal of 
cancer stem cells. 

1 5 Example 4 

Identifying Stem Cell Cancer Markers 

This Example describes how various stem cell cancer markers were identified 
using microarray screens. The results of these screens were processed and the names 
of the differentially expressed genes are reported in Tables 4-8 (see above). 

20 In order to generate gene expression profiles, human breast tumorigenic cells 

which were initially isolated. A series of samples were accumulated from human 
breast tumors or normal tissues. These were generated as follows. Three passaged 
breast tumors — breast tumor cells from patient 1, 2, 3 were engrafted on miceach 
tumor was engrafted on three mice to make the triplicate tumors. The breast 

25 tumorigenic cells were then isolated from these tumors. Two or three unpassaged 
breast tumors from three patients SUM, PE13, PE15 were labeled and sorted into 
tumorigenic cells (TG) or non-tumorigenic cells (NTG). Both PE15-TG and PE15- 
NTG were triplicate. Two or three normal breast samples were from breast reduction 
patients. Breast epithelial cells (Breast) were isolated with flow cytometry and used 

30 for microarray. Two or three normal colon samples were collected freshly from colon 

patients. Colon epithelial cells (Colon) were isolated with flow cytometry and used 

for microarray. Two or three normal stem cell samples (normal bone marrow) were 
collected from bone marrow donors. Hematopoietic stem cells (HSC) were isolated 
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with flow cytometry. Probes were made from the following were made from the 
various cells types for use in the mircoarray analysis. 

In order to perform the various microarray screens Affymetrix HG-U133 gene 
chips were used. The normalized gene expression intensity was used to generate the 

5 data that was collected in a number of large tables. The results in these tables was 
processed and used to generate Tables 4, 5, 6, 7a, 7b, 7c, 7d and 8, which present the 
names of the genes found to be differentially expressed. For tables 4-6, candidate 
cancer markers were sorted by identifying genes whose expression was greater or less 
than 1.5 fold in unpassaged breast tumorigenic cells comparing to non-tumorigenic 

10 cells or the normal stem cells (HSC). Table 6 shows only those genes found to be 

down regulated in UPTG vs. UPNTG. Table 5 shows only those genes found to be up 
regulated in UPTG vs. HSC. Table 4 shows only those genes found to be up 
regulated in UPTG vs UPNTG. For tables 7a, 7b, 7c and 7d, cancer markers were 
were generated from the larger tables by standard T-test. These tables were sorted 

15 based on T-score is <0.01 and ratio is more than 2 fold. Table 7a shows only those 
genes found to be up regulated in UPTG vs. HSC. Tables 7b shows only those genes 
found to be down regulated in UPTG vs. HSC. Table 7c shows only those genes 
found to be up regulated in PTG vs HSC. Table 7d shows only those genes found to 
be down regulated in PTG vs HSC. 

20 
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CLAIMS 

We claim: 

1 . A method of detecting solid tumor stem cells, comprising; 
5 a) providing a tissue sample from a subject, and 

b) detecting at least one stem cell cancer marker from Tables 4-8 
in said tissue sample under conditions such that the presence or absence of 
solid tumor stem cells in said tissue sample is determined. 

10 2. The method of Claim 1 , wherein said detecting comprises determining 

an expression level for said at least one stem cell cancer marker. 

3. The method of Claim 1, wherein said detecting comprises detecting 
mRNA expression of said at least one stem cell cancer marker. 

15 

4. The method of Claim 1, wherein said detecting comprises detecting 
polypeptide expression of said at least one stem cell cancer marker. 

5. The method of Claim 1, wherein said subject comprises a human 

20 subject. 

6. The method of Claim 1 , wherein said tissue sample comprises tumor 

tissue. 

25 7. The method of Claim 1 , further comprising the step of c) providing a 

prognosis to said subject. 

8. The method of Claim 1, wherein said at least one stem cell cancer 
marker is from Table 8. 

30 

9. The method of Claim 1 , wherein said at least one stem cell cancer 
marker is selected from the group consisting of: Bmi-1, eed, easyhl, easyh2, rnf2, 
yyl, smarcA3, smarcA5, smarcD3, smarcEl, mllt3, FZD1, FZD2, FZD3, FZD4, 
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FZD6, FZD7, FZD8, FZD9, FZD 1 0, WNT2, WNT2B, WNT3, WNT5A, WNT1 OB, 
WNT16, AXIN1, BCL9, MYC, and (TCF4). 

10. A method for reducing the size of a solid tumor comprising contacting 
5 cells of a solid tumor with a biologically effective amount of a composition 

comprising at least one agent directed against at least one stem cell cancer marker 
shown in Tables 4-8. 

1 1 . The method of claim 1 0, wherein said biologically effective amount is 
10 an amount sufficient to cause cell death of or inhibit proliferation of solid tumor stem 

cells in said solid tumor. 



15 



12 . The method of claim 10, wherein said at least one agent is an antibody, 
peptide or small molecule. 

1 3. The method of claim 1 2, wherein the antibody, peptide or small 
molecule is directed against the extracellular domain of said at least one stem cell 
cancer marker. 



20 14. The method of claim 1 0, wherein said at least one stem cell cancer 

marker is selected from the group consisting of: Bmi-1, eed, easyhl, easyh2, rn£2, 
yyl, smarcA3, smarcA5, smarcD3, smarcEl, mllt3, FZD1, FZD2, FZD3, FZD4, 
FZD6, FZD7, FZD8, FZD9, FZD10, WNT2, WNT2B, WNT3, WNT5A, WNT10B, 
WNT16, AXIN1, BCL9, MYC, and (TCF4). 

25 

15. A method for killing or inhibiting the proliferation of solid tumor stem 
cells comprising contacting said solid tumor stem cells with a biologically effective 
amount of a composition comprising at least one agent targeted to at least one stem 
cell cancer marker shown in Tables 4-8. 

30 

16. The method of claim 1 5, further comprising identifying the death of or 
the prevention of the growth of said solid tumor stem cells following said contacting. 



17. The method of Claim 16, wherein said cell death is apoptosis. 
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18. The method of claim 15, wherein said at least one stem cell cancer 
marker is selected from the group consisting of: Bmi-1, eed, easyhl, easyh2, mf2, 
yyl, smarcA3, smarcAS, smarcD3, smarcEl, mllt3, FZD1, FZD2, FZD3, FZD4, 

5 FZD6, FZD7, FZD8, FZD9, FZD10, WNT2, WNT2B, WNT3, WNT5A, WNT10B, 
WNT16, AXIN1, BCL9, MYC, and (TCF4). 

19. The method of claim 15, wherein said solid tumor stem cells express 
cell surface marker CD44, ESA, or B3 8. 1 . 



10 



15 



20. Amethod of distinguishing tumorigenic from non-tumorigenic cancer 
cells, comprising: detecting the presence of P-catenin in a cancer cell such that the 
localization of p -catenin in the cancer cell is determined to be primarily nuclear or 
primarily cytoplasmic. 

2 1 . The method of Claim 20, further comprising identifying the cancer cell 
as tumorigenic if the P -catenin localization is primarily nuclear, or identifying the 
cancer cell as non-tumorigenic if the P -catenin localization is primarily cytoplasmic. 
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